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1 atgggaggag gacagtcaat aatgaagcaa tttaaaagta taattaacac gtcgcaggac 

61 tttgaaaaaa gaatagaaaa gataaaaaaa gaagtaatca atgacccaga tgttaagcaa 

121 tttttggaag cgcatcgagc tgaattaacg aatgctatga ttgatgaaga cttaaatgtg 

181 ttacaagagt ataaagatca acaaaaacat tatgacggtc ataaatttgc tgattgtcca 

241 aatttcgtaa aggggcatgt gcctgagtta tatgttgata ataaccgaat taaaatacgc 

3 01 tatttacaat gcccatgtaa aatcaagtac gacgaagaac gctttgaagc tgagctaatt 

361 acatctcatc atatgcaacg agatacttta aatgccaaat tgaaagatat ttatatgaat 

421 catcgagacc gtcttgatgt agctatggca gcagatgata tttgtacagc aataactaat 

481 ggggaacaag tgaaaggcct ttacctttat ggtccatttg ggacaggtaa atcttttatt 

541 ctaggtgcaa ttgcgaatca gctcaaatct aagaaggtac gttcgacaat tatttattta 

601 ccggaattta ttagaacatt aaaaggtggc tttaaagatg gttcttttga aaagaaatta 

661 catcgcgtaa gagaagcaaa cattttaatg cttgatgata ttggggctga agaagtgact 

721 ccatgggtga gagatgaggt aattggacct ttgctacatt atcgaatggt tcatgaatta 

781 ccaacattct ttagttctaa ttttgactat agtgaattgg aacatcattt agcgatgact 

841 cgtgatggtg aagagaagac taaagcagca cgtattattg aacgtgtcaa atctttgtca 

901 acaccatact ttttatcagg agaaaatttc agaaacaatt ga 



Figure IB 
SEQ ID NO: 2 

1 MGGGQSIMKQ FKSIINTSQD FEKRIEKIKK EVINDPDVKQ FLEAHRAELT NAMIDEDLNV 

61 LQEYKDQQKH YDGHKFADCP NFVKGHVPEL YVDNNRIKIR YLQCPCKI KY DEERFEAELI 

121 TSHHMQRDTL NAKLKDIYMN HRDRLDVAMA ADDICTAITN GEQVKGLYLY GPFGTGKSFI 

181 LGAIANQLKS KKVRSTIIYL PEFIRTLKGG FKDGSFEKKL HRVREANILM LDDIGAEEVT 

241 PWVRDEVIGP LLHYRMVHEL PTFFSSNFDY SELEHHLAMT RDGEEKTKAA RIIERVKSLS 

301 TPYFLSGENF RNN 
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Figure 2A 

SEQ ID NO: 3 Complete genome sequence of bacteriophage 77 

1 gatcaaaata cttggggaac ggttagggag taaacttcgc gataatttta aaaattcatg 

61 tataaccccc ctcttataac cattttaagg caggtgatga aatggagatt atagtcgatg 

121 aaaatttagt gcttaaagaa aaagaaaggc tacaagtatt atataaagac atacctagca 

181 ataaattaaa agtagttgat ggtttaatta ttcaagcagc aaggctacgt gtaatgcttg 

241 attacatgtg ggaagacata aaagaaaaag gtgattatga tttatttact caatctgaaa 

3 01 aggcgccacc atatgaaagg gaaagaccag tagccaaact atttaatgct agagatgctg 

361 catatcaaaa aataatcaaa caattatcgg atttattgcc cgaagagaaa gaagacacag 

421 aaacgccatc tgatgattac ctatgattag taataaatac gttgatgaat atataaattt 

481 gtggaaacaa ggaaagataa ttttaaataa agaaagaatt gatctcttta attatctaca 

541 aaaacatata tattcacgag atgatgtata ttttgatgaa cagaaaatcg aggattgtat 

601 caaatttatt gaaaaatggt attttccaac attaccattt caaaggttta tcatagctaa 

661 tatatttctt atagataaaa atacagatga agctttcttt acagaatttg ctattttcat 

721 gggacgtgga ggcgggaaaa acggtctaat aagtgctatt agtgattttc tttctacgcc 

781 cttacacgga gttaaagaat atcacatctc cattgttgct aatagtgaag atcaagcaaa 

841 aacatcgttt gatgaaatca gaaccgtttt aatggataac aaacgaaata agacgggtaa 

901 aacgccaaaa gctccttatg aagttagtaa agcaaaaata ataaaccgtg caactaaatc 

961 ggttattcga tataacacat caaacacaaa aaccaaagac ggtggacgtg aggggtgtgt 

1021 tatttttgat gaaattcatt atttctttgg tcctgaaatg gtaaacgtca aacgtggtgg 

1081 attaggtaaa aagaaaaata gaagaacgtt ttatataagt actgatggtt ttgttagaga 

1141 gggttatatc gatgcaatga agcacaaaat tgcaagtgta ttaagtggca aggttaaaaa 

1201 tagtagattg tttgcttttt attgtaagtt agacgatcca aaagaagttg atgacagaca 

1261 gacgtgggaa aaggcgaacc caatgttaca taaaccgtta tcagaatacg ctaaaacact 

1321 gctaagcacg attgaagaag aatataacga tttaccattc aaccgttcaa ataagcccga 

1381 attcatgact aagcgaatga atttgcctga agttgacctt gaaaaagtaa tagcaccatg 

1441 gaaagaaata ctagcgacta atagagagat accaaattta gataatcaaa tgtgtattgg 

1501 tggtttagac tttgcaaaca ttcgagattt tgcaagtgta gggctattat tccgaaaaaa 

1561 cgatgattac atttggttag gacattcgtt tgtaagacaa gggtttttgg atgatgtcaa 

1621 attagaacct cctattaaag aatgggaaaa aatgggatta ttgaccattg tcgatgatga 

1681 tgtcattgaa attgaatata tagttgattg gtttttaaag gctagagaaa aatatgggct 

1741 tgaaaaagtc atagctgata attatagaac tgatattgta agacgtgcgt ttgaggatgc 

1801 tggcataaaa cttgaagtac ttagaaatcc aaaagcaata catggattac ttgcaccacg 

1861 tatcgataca atgtttgcga aacataacgt aatatatgga gacaatcctt tgatgcgttg 

1921 gtttactaat aatgttgctg taaaaatcaa gccggatgga aataaagagt atatcaaaaa 

1981 agatgaagtc agacgtaaaa cggatggatt catggctttt gttcacgcat tatatagagc 

2041 agacgatata gtagacaaag acatgtctaa agcgcttgat gcattaatga gtatagattt 

2101 ctaatagagg aggtgagaca tgagtattct agaaaagata tttaaaacta ggaaagatat 

2161 aacatatatg cttgatttag atatgataga agatctatca caacaagcgt atgtgaaacg 

2221 tttagcgatt gatagttgta ttgaatttgt tgcgcgagct gtcgctcaaa gtcattttaa 

2281 agtattggaa ggtaatagaa ttcaaaagaa tgatgtttac tacaagttaa atataaaacc 

2341 aaatactgac ttatcaagcg atagtttttg gcaacaagtt atatataaac taatttatga 

2401 taacgaggtt ttaatcgtag taagtgacag caaagaatta cttatcgcag atagctttta 

2461 cagagaagag tacgctttgt atgatgatat attcaaagat gtaacggtta aagattatac 

2521 ttatcaacgt actttcacaa tgcaagaggt catatattta aagtacaaca acaataaagt 

2581 gacacacttt gtagaaagtc tattcgaaga ttacgggaaa atattcggaa gaatgatagg 

2641 tgcacaatta aaaaactatc aaataagagg gattttgaaa tctgcctcta gcgcatatga 

2701 cgaaaagaat atagaaaaat tacaagcgtt cacaaataaa ttattcaata cttttaataa 
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Figure 2B 

2761 aaatcaacta gcaatcgcgc ctttgataga aggttttgat tatgaggaat tatctaatgg 

2821 tggtaagaat agtaacatgc ctttttctga attgagtgag ctaatgagag atgcaataaa 

2881 aaatgttgcg ttgatgattg gtatacctcc aggtttgatt tacggagaaa cagctgattt 

2941 ggaaaaaaac acgcttgtat ttgagaagtt ctgtttaaca cctttattaa aaaagattca 

3001 gaacgaatta aacgcgaaac tcataacaca aagcatgtat ttgaaagata caagaataga 

3061 aattgtcggt gtgaataaaa aagacccact tcaatatgct gaagcaattg acaaacttgt 

3121 aagttctggt tcatttacaa ggaatgaggt gcggattatg ttaggtgaag aaccatcaga 

3181 caatcctgaa ttagacgaat acctgattac taaaaactac gaaaaagcta acagtggtga 

3241 aaatgatgaa aaagaaaaag atgaaaacac tttgaaaggt ggtgatgaag atgaaagcgg 

3301 agattaaagg cgtcatcgtt tccaacgaag ataaatgggt ttacgaaatg cttggtatgg 

3361 attcgacttg tcctaaagat gttttaacac aactagaatt tagtgatgaa gatgttgata 

3421 ttataattaa ctcaaatggt ggtaacctag tagctggtag tgaaatatat acacatttaa 

3481 gagctcataa aggcaaagtg aatgttcgta tcacagcaat agcagcaagt gcggcatcgc 

3541 ttatcgcaat ggctggtgac cacatcgaaa tgagtccggt tgctagaatg atgattcaca 

3601 atccttcaag tattgcgcaa ggagaagtga aagatctaaa tcatgctgca gaaacattag 

3661 aacatgttgg tcaaataatg gctgaggcat atgcggttag agctggtaaa aacaaacaag 

3721 aacttataga aatgatggct aaggaaacgt ggctaaatgc tgatgaagcc attgaacaag 

3781 gttttgcgga tagtaaaatg tttgaaaacg acaatatgca aattgtagca agcgatacac 

3841 aagtgttatc gaaagatgta ttaaatcgtg taacagcttt ggtaagtaaa acgccagagg 

3 901 ttaacattga tattgacgca atagcaaata aagtaattga aaaaataaat atgaaagaaa 

3 961 aggaatcaga aatcgatgtt gcagatagta aattatcagc aaatggattt tcaagattcc 

4021 ttttttaata caaaaatagg aggtcataaa atgactataa atttatcgga aacattcgca 

4081 aatgcgaaaa acgaatttat taatgcagta aacaacggtg aaccgcaaga aagacaaaat 

4141 gaattgtacg gtgacatgat taaccaacta tttgaagaaa ctaaattaca agcaaaagca 

4201 gaagctgaaa gagtttctag tttacctaaa tcagcacaaa ctttgagtgc aaaccaaaga 

4261 aatttcttta tggatatcaa taagagtgtt ggatataaag aagaaaaact tttaccagaa 

4321 gaaacaattg atagaatctt cgaagattta acaacgaatc atccattatt agctgactta 

4381 ggtattaaaa atgctggttt gcgtttgaag ttcttaaaat ccgaaacttc tggcgtggct 

4441 gtttggggta aaatctatgg tgaaattaaa ggtcaattag atgctgcgtt cagtgaagaa 

4501 acagcaattc aaaataaatt gacagcgttt gttgttttac caaaagattt aaatgatttt 

4561 ggtcctgcgt ggattgaaag atttgttcgt gttcaaatcg aagaagcatt tgcagtggcg 

4621 cttgaaactg cgttcttaaa aggtactggt aaagaccaac cgattggctt aaaccgtcaa 

4681 gtacaaaaag gtgtatcggt aactgatggt gcttatccag agaaagaaga acaaggtacg 

4741 cttacatttg ctaatccgcg cgctacggtt aatgaattga cgcaagtgtt taaataccac 

4801 tcaactaacg agaaaggtaa atcagtagcg gttaaaggta atgtaacaat ggttgttaat 

4861 ccgtccgatg cttttgaggt tcaagcacag tatacacatt taaatgcaaa tggcgtatat 

4921 gttactgctt taccatttaa tttgaatgtt attgagtcta cagttcaaga agcaggtaag 

4981 gttttaacgt acgttaaagg tctatatgat ggttatttag ctggtggtat taatgttcag 

5041 aaatttaaag aaacacttgc gttagatgat atggatttat acactgcaaa acaatttgct 

5101 tacggcaaag cgaaagataa taaagttgct gctgtttgga aattagattt aaaaggacat 

5161 aaaccagctt tagaagatac cgaagaaaca ctataaaatt ttatgaggtg ataaaatggt 

5221 gaaatttaaa gttgttagag aatttaaaga catagagcac aatcaacaca agtacaaagt 

5281 aggggagttg tatccagctg aagggtataa caatcctcgt gttgaattgt tgacaaatca 

5341 aatcaaaaat aagtacgaca aagtttatat cgtaccttta gataagctga caaaacaaga 

5401 attattagaa ctatgcgaat cattacaaaa aaaagcgtct agttcaatgg ttaaaagtga 

5461 aatcatcgac ttattgaatg gtgaagacaa tgacgattga tgatttgctt gtcaaattta 

5521 aatcacttga aaagattgac cataattcag aggatgagta cttaaagcag ttgttaaaaa 

5581 tgtcgtacga gcgtataaaa aatcagtgcg gagtttttga attagagaat ttaataggtc 

5641 aagaattgat acttatacgc gctagatatg cttatcaaga tttattagaa cacttcaacg 

5701 acaattacag acctgaaata atagattttt cgttatctct aatggaggta tcagaagatg 
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Figure 2C 

5761 aagaaagtgt ttaagaaacc tagaattaca actaaacgtt taaatacgcg tgttcatttt 

5821 tataagtata ctgaaaataa tggtccagaa gctggagaaa aagaagaaaa attattatat 

5881 agctgttggg cgagtattga tggtgtctgg ttacgtgaat tagaacaagc tatctcaaac 

5941 ggaacgcaaa atgacattaa attgtatatt cgtgatccgc aaggtgatta tttacccagt 

6001 gaagaacatt atcttgaaat tgaatcaaga tatttcaaaa atcgtttgaa tataaagcaa 

6061 gtatcaccag atttggataa taaagacttt attatgattc gcggaggata tagttcatga 

6121 gtgtgaaagt gacaggtgat aaagcattag aaagagaatt agaaaaacat tttggcataa 

6181 aagagatggt aaaagttcaa gataaggcgt taatagctgg tgctaaggta attgttgaag 

6241 aaataaaaaa acaactcaaa ccttcagaag actcaggagc actgattagt gagattggtc 

6301 gtactgaacc tgaatggata aaggggaaac gtactgttac aattaggtgg cgtgggcctt 

63 61 ttgaacgatt tagaatagta catttaattg aaaatggtca tgttgagaaa aagtcaggaa 

6421 aatttgtaaa acctaaagct atgggtggga ttaatagagc aataagacaa gggcaaaata 

6481 agtattttga gacgctaaaa agggagttga aaaaattgtg attgatattt tgtacaaagt 

6541 tcatgaagtg attagtcaag acagaattat tagagagcac gtaaatatca ataatattaa 

6601 gttcaataaa taccctaatg taaaagatac tgatgtacct tttattgtta ttgacgatat 

6661 cgacgaccca atacctacaa cttatactga cggagatgag tgtgcatata gttatattgt 

6721 ccaaatagat gtttttgtta agtacaatga tgaatataat gcgagaatca taagaaataa 

6781 gatatctaat cgcattcaaa agttattatg gtctgaacta aaaatgggaa atgtttcaaa 

6841 tggaaaaccg gaatatatag aagaatttaa aacatataga agctctcgcg tttacgaggg 

6901 cattttttat aaggaggaaa attaaatggc agtaaaacat gcaagtgcgc caaaggcgta 

6 961 tattaacatt actggtttag gtttcgctaa attaacgaaa gaaggcgcgg aattaaaata 

7021 tagtgatatt acaaaaacaa gaggattaca aaaaattggt gttgaaactg gtggagaact 

7081 aaaaacagct tatgctgatg gcggtccaat tgaatcaggg aatacagacg gagaaggtaa 

7141 aatctcatta caaatgcatg cgttccctaa agagattcgc aaaattgttt ttaatgaaga 

7201 ttatgatgaa gatggcgttt acgaagagaa acaaggtaaa caaaacaatt acgtagctgt 

7261 atggttcaga caagagcgta aagacggtac atttagaaca gttttattac ctaaagttat 

7321 gtttacaaat cctaaaatcg atggagaaac ggctgagaaa gattgggatt tctcaagtga 

7381 agaggttgaa ggtgaggcac ttttcccttt agttgataat aaaaagtcag tacgtaagta 

7441 tatctttgat tcagctaaca tgacaaatca tgatggagac ggtgaaaaag gcgaagaggc 

7501 tttcttaaag aaaattttag gcgaagaata tactggaaac gtgacagagg gtaacgaaga 

7561 aactttgtaa caaaaccggc ttcatcggaa actgcggtaa agtcggttaa tataccagat 

7621 agcattaaaa cacttaaagt tggcgacaca tacgatttaa atgttgtagt agagccatct 

7681 aatcaaagta agttattgaa atacacaaca gatcaaacga atattgtatc aatcaatagt 

7741 gatggtcaag ttactgcgga agcacaaggc attgctacgg ttaaagcaac agttggtaat 

7801 atgagtgaca ctataacaat aaatgtagaa gcataagagg gggcaacccc tctattttat 

7861 ttgaaaataa ggagagtatt ataaaatggc aaaattaaaa cgtaacatta ttcaattagt 

7921 agaagatcca aaagcaaatg aaattaaatt acaaacgtac ttaacaccac acttcatttc 

7981 atttgaaatt gtatacgaag caatggattt aatcgatgat attgaggacg aaaatagcac 

8041 gatgaagcca agagaaatcg ctgacagatt gatggatatg gttgtaaaaa tttacgataa 

8101 ccaattcaca gttaaagacc taaaagaacg tatgcatgca cctgatggaa tgaatgcact 

8161 tcgtgaacaa gtgattttca ttactcaagg tcaacaaact gaggaaacta gaaattttat 

8221 ccagaacatg aaataaagcc tgaagattta acatataaag caatgttgaa aaatatggat 

8281 actctcatga tggacttaat tgaaaatggt aaagacgcta acgaagtttt aaaaatgcca 

8341 tttcattatg tgctttccat atatcaaaat aaaaataatg acatttctga agaaaaagca 

8401 gaggctttaa ttgatgcatt ttaaccttaa ccgtttggtt agggttattt ttttgaactt 

8461 ttttagaaag gaggtaaaaa atgggagaaa gaataaaagg tttatctata ggtttggatt 

8521 tagatgcagc aaatttaaat agatcatttg cagaaatcaa acgaaacttt aaaactttaa 

8581 attctgactt aaaattaaca ggcaacaact tcaaatatac cgaaaaatca actgatagtt 

8641 acaaacaaag gattaaagaa cttgatggaa ctatcacagg ttataagaaa aacgttgatg 

8701 atttagccaa gcaatatgac aaggtatctc aagaacaggg cgaaaacagt gcagaagctc 
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Figure 2D 

8761 aaaagttacg acaagaatat aacaaacaag caaatgagct gaattattta gaaagagaat 

8821 tacaaaaaac atcagccgaa tttgaagagt tcaaaaaagc tcaagttgaa gctcaaagaa 

8881 tggcagaaag tggctgggga aaaaccagta aagtttttga aagtatggga cctaaattaa 

8941 caaaaatggg tgatggttta aaatccattg gtaaaggttt gatgattggt gtaactgcac 

9001 ctgttttagg tattgcagca gcatcaggaa aagcttttgc agaagttgat aaaggtttag 

9061 atactgttac tcaagcaaca ggcgcaacag gcagtgaatt aaaaaaattg cagaactcat 

9121 ttaaagatgt ttatggcaat tttccagcag atgctgaaac tgttggtgga gttttaggag 

9181 aagttaatac aaggttaggt tttacaggta aagaacttga aaatgccaca gagtcattct 

9241 tgaaattcag tcatataaca ggttctgacg gtgtgcaagc cgtacagtta attacccgtg 

9301 caatgggcga tgcaggtatc gaagcaagtg aatatcaaag tgttttggat atggtagcaa 

9361 aagcggcgca agctagtggg ataagtgttg atacattagc tgatagtatt actaaatacg 

9421 gcgctccaat gagagctatg ggctttgaga tgaaagaatc aattgcttta ttctctcaat 

9481 gggaaaagtc aggcgttaat actgaaatag cattcagtgg tttgaaaaaa gctatatcaa 

9541 attggggtaa agctggtaaa aacccaagag aagaatttaa gaagacatta gcagaaattg 

9601 aaaagacgcc ggatatagct agcgcaacaa gtttagcgat tgaagcattt ggtgcaaagg 

9661 caggtcctga tttagcagac gctattaaag gtggtcgctt tagttatcaa gaatttttaa 

9721 aaactattga agattcccaa ggcacagtaa accaaacatt taaagattct gaaagtggct 

9781 ccgaaagatt taaagtagca atgaataaat taaaattagt aggtgctgat gtatgggctt 

9841 ctattgaaag tgcgtttgct cccgtaatgg aagaattaat caaaaagcta tctatagcgg 

9901 ttgattggtt ttccaattta agtgatggtt ctaaaagatc aattgttatt ttcagtggta 

9961 ttgctgctgc aattggtcct gtagtttttg ggttaggtgc atttataagt acaattggca 

10021 atgcagtaac tgtattagct ccattgttag ctagtattgc aaaggctggt ggattgatta 

10081 gttttttatc gactaaagta cctatattag gaactgtctt cacagcttta actggtccaa 

10141 ttggcattgt attaggtgta ttggctggtt tagcagtcgc atttacaatt gcttataaga 

10201 aatctgaaac atttagaaat tttgttaatg gtgcaattga aagtgttaaa caaacattta 

10261 gtaattttat tcaatttatt caacctttcg ttgattctgt taaaaacatc tttaaacaag 

10321 cgatatcagc aatagttgat ttcgcaaaag atatttggag tcaaatcaat ggattcttta 

10381 atgaaaacgg aatttccatt gttcaagcac ttcaaaatat atgcaacttt attaaagcga 

10441 tatttgaatt tattttaaat tttgtaatta aaccaattat gttcgcgatt tggcaagtga 

10501 tgcaatttat ttggccggcg gttaaagcct tgattgtcag tacttgggag aacataaaag 

10561 gtgtaataca aggtgcttta aatatcatac ttggcttgat taagttcttc tcaagtttat 

10621 tcgttggtga ttggcgagga gtttgggacg ccgttgtgat gattcttaaa ggagcagttc 

10681 aattaatttg gaatttagtt caattatggt ttgtaggtaa aatacttggt gttgttaggt 

10741 actttggcgg gttgctaaaa ggattgatag caggaatttg ggacgtaata agaagtatat 

10801 tcagtaaatc tttatcagca atttggaatg caacaaaaag tatttttgga tttttattta 

10861 atagcgtaaa atcaattttc acaaatatga aaaattggtt atctaatact tggagcagta 

10921 tccgtacgaa tacaatagga aaagcgcagt cattatttag tggcgtcaaa tcaaaattta 

10981 ctaatttatg gaatgcgacg aaagaaattt ttagtaattt aagaaattgg atgtcaaata 

11041 tttggaattc cattaaagat aatacggtag gaattgcaag ccgtttatgg agtaaggtac 

11101 gtggaatttt cacaaatatg cgcgatggct tgagttccat tatagataag attaaaagtc 

11161 atatcggcgg tatggtaagc gctattaaaa aaggacttaa taaattaatc gacggtttaa 

11221 actgggtcgg tggtaagttg ggaatggata aaatacctaa gttacacact ggtacagagc 

11281 acacacatac tactacaaga ttagttaaga acggtaagat tgcacgtgac acattcgcta 

11341 cagttgggga taagggacgc ggaaatggtc caaatggttt tagaaatgaa atgattgaat 

11401 tccctaacgg taaacgtgta atcacaccta atacagatac taccgcttat ttacctaaag 

11461 gctcaaaagt atacaacggt gcacaaactt attcaatgtt aaacggaacg cttccaagat 

11521 ttagtttagg tactatgtgg aaagatatta aatctggtgc atcatcggca tttaactgga 

11581 caaaagataa aataggtaaa ggtaccaaat ggcttggcga taaagttggc gatgttttag 

11641 attttatgga aaatccaggc aaacttttaa attatatact tgaagctttt ggaattgatt 

11701 tcaattcttt aactaaaggt atgggaattg caggcgacat aacaaaagct gcatggtcta 
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Figure 2E 



11761 agattaagaa aagtgctact gattggataa aagaaaattt agaagctatg ggcggtggcg 

11821 atttagtcgg cggaatatta gaccctgaca aaattaatta tcattatgga cgtaccgcag 

11881 cttataccgc tgcaactgga agaccatttc atgaaggtgt cgattttcca tttgtatatc 

11941 aagaagttag aacgccgatg ggtggcagac ttacaagaat gccatttatg tctggtggtt 

12001 atggtaatta tgtaaaaatt actagtggcg ttatcgatat gctatttgcg catttgaaaa 

12061 actttagcaa atcaccacct agtggcacga tggtaaagcc cggtgatgtt gttggtttaa 

12121 ctggtaatac cggatttagt acaggaccac atttacattt tgaaatgagg agaaatggac 

12181 gacattttga ccctgaacca tatttaagga atgctaagaa aaaaggaaga ttatcaatag 

12241 gtggtggcgg tgctacttct ggaagtggcg caacttatgc cagtcgagta atccgacaag 

12301 cgcaaagtat tttaggtggt cgttataaag gtaaatggat tcatgaccaa atgatgcgcg 

12361 ttgcaaaacg tgaaagtaac taccagtcaa atgcagtgaa taactgggat ataaatgctc 

12421 aaagaggaga cccatcaaga ggattattcc aaatcatcgg ctcaactttt agagcaaacg 

12481 ctaaacgtgg atatactaac tttaataatc cagtacatca aggtatctca gcaatgcagt 

12541 acattgttag acgatatggt tggggtggtt ttaaacgtgc tggtgattac gcatatgcta 

12601 caggtggaaa agtttttgat ggttggtata acttaggtga agacggtcat ccagaatgga 

12661 ttattccaac agatccagct cgtagaaatg atgcaatgaa gattttgcat tatgcagcag 

12721 cagaagtaag agggaaaaaa gcgagtaaaa ataagcgtcc tagccaatta tcagacttaa 

12781 acgggtttga tgatcctagc ttattattga aaatgattga acaacagcaa caacaaatag 

12841 ctttattact gaaaatagca caatctaacg atgtgattgc agataaagat tatcagccga 

12 901 ttattgacga atacgctttt gataaaaagg tgaacgcgtc tatagaaaag cgagaaaggc 

12 961 aagaatcaac aaaagtaaag tttagaaaag gaggaattgc tattcaatga tagacactat 

13 021 taaagtgaac aacaaaacaa ttccttggtt gtatgtcgaa agagggtttg aaataccctc 
13081 ttttaattat gttttaaaaa cagaaaatgt agatggacgt tcggggtcta tatataaagg 
13141 gcgtaggctt gaatcttata gttttgatat acctttggtg gtacgtaatg actatttatc 
13201 tcacaacggc attaaaacac atgatgacgt cttgaatgaa ttagtaaagt tttttaacta 
13261 cgaggaacaa gttaaattac aattcaaatc taaagattgg tactggaacg cttatttcga 
13321 aggaccaata aagctgcaca aagaatttac aatacctgtt aagttcacta tcaaagtagt 
13381 actaacagac ccttacaaat attcagtaac aggaaataaa aatactgcga tttcagacca 
13441 agtttcagtt gtaaatagtg ggactgctga cactccttta attgttgaag cccgagcaat 
13501 taaaccatct agttacttta tgattactaa aaatgatgaa gattatttta tggttggtga 
13561 tgatgaggta accaaagaag ttaaggatta catgcctcct gtttatcata gtgagtttcg 
13621 tgatttcaaa ggttggacta agatgattac tgaagatatt ccaagtaatg acttaggtgg 
13 681 taaggtcggc ggtgactttg tgatatccaa tcttggcgaa ggatataaag caactaattt 
13 741 tcctgatgca aaaggttggg ttggtgctgg cacgaaacga gggctcccta aagcgatgac 
13 801 agattttcaa attacctata aatgtattgt tgaacaaaaa ggtaaaggtg ccggaagaac 
13861 agcacaacat atttatgata gtgatggtaa gttacttgct tctattggtt atgaaaataa 
13 921 atatcatgat agaaaaatag gacatattgt tgttacgttg tataaccaaa aaggagaccc 
13 981 caaaaagata tacgactatc agaataaacc gataatgtat aacttggaca gaatcgttgt 
14041 ttatatgcgg ctcagaagag taggtaataa attttctatt aaaacttgga aatttgatca 
14101 cattaaagac ccagatagac gtaaacctat tgatatggat gagaaagagt ggatagatgg 
14161 cggtaagttt tatcagcgtc cagcttctat catagctgtc tatagtgcga agtataacgg 
14221 ttataagtgg atggagatga atgggttagg ttcattcaat acggagattc taccgaaacc 
14281 gaaaggcgca agggatgtca ttatacaaaa aggtgattta gtaaaaatag atatgcaagc 
14341 aaaaagtgtt gtcatcaatg aggaaccaat gttgagcgag aaatcgtttg gaagtaatta 
14401 tttcaatgtt gattctgggt acagtgaatt aatcatacaa cctgaaaacg tctttgatac 
14461 gacggttaaa tggcaagata gatatttata gaaaggagat gagagtgtga tacatgtttt 
14521 agattttaac gacaagatta tagatttcct ttctactgat gacccttcct tagttagagc 
14581 gattcataaa cgtaatgtta atgacaattc agaaatgctt gaactgctca tatcatcaga 
14641 aagagctgaa aagttccgtg aacgacatcg tgttattata agggattcaa acaaacaatg 
14701 gcgtgaattt attattaact gggttcaaga tacgatggac ggctacacag agatagaatg 
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Figure 2F 

14761 tatagcgtct tatcttgctg atataacaac agctaaaccg tatgcaccag gcaaatttga 

14821 gaaaaagaca acttcagaag cattgaaaga tgtgttgagc gatacaggtt gggaagtttc 

14881 tgaacaaacc gaatacgatg gcttacgtac tacgtcatgg acttcttatc aaactagata 

14941 tgaagtttta aagcaattat gtacaaccta taaaatggtt ttagattttt atattgagct 

15001 tagctctaat accgtcaaag gtagatatgt agtactcaaa aagaaaaaca gcttattcaa 

15061 aggtaaagaa attgaatatg gtaaagattt agtcgggtta actaggaaga ttgatatgtc 

15121 agaaatcaaa acagcattaa ttgctgtggg acctgaaaat gacaaaggga agcgtttaga 

15181 gctagttgtg acagatgacg aagcgcaaag tcaattcaac ctacctatgc gctatatttg 

15241 ggggatatat gaaccacaat cagatgatca aaatatgaat gaaacacgat taagttcttt 

15301 agccaaaaca gagttaaata aacgtaagtc ggcagttatg tcatatgaga ttacttctac 

15361 tgatttggaa gttacgtatc cgcacgagat tatatcaatt ggcgatacag tcagagtaaa 

15421 acatagagat tttaacccgc cattgtatgt agaggcagaa gttattgctg aagaatataa 

15481 cataatttca gaaaatagca catatacatt cggtcaacct aaagagttca aagaatcaga 

15541 attacgagaa gagtttaaca agcgattgaa cataatacat caaaagttaa acgataatat 

15601 tagcaatatc aacactatag ttaaagatgt tgtagatggt gaattagaat actttgaacg 

15661 caaaatacac aaaagtgata caccgccaga aaatccagtc aatgatatgc tttggtatga 

15721 tacaagtaac cctgatgttg ctgtcttgcg tagatattgg aatggtcgat ggattgaagc 

15781 aacaccaaat gatgttgaaa aattaggtgg tataacaaga gagaaagcgc tattcagtga 

15841 attaaacaat atttttatta atttatctat acaacacgct agtcttttgt cagaagctac 

15901 agaattactg aatagcgagt acttagtaga taatgatttg aaagcggact tacaagcaag 

15961 tttagacgct gtgattgatg tttataatca aattaaaaat aatttagaat ctatgacacc 

16021 cgaaactgca acgattggtc ggttggtaga tacacaagct ttatttcttg agtatagaaa 

16081 gaaattacaa gatgtttata cagatgtaga agatgtcaaa atcgccattt cagatagatt 

16141 taaattatta cagtcacaat acactgatga aaaatataaa gaagcgttgg aaataatagc 

16201 aacaaaattt ggtttaacgg tgaatgaaga tttgcagtta gtcggagaac ctaatgttgt 

16261 taaatcagct attgaagcag ctagagaatc cacaaaagaa caattacgtg actatgtaaa 

16321 aacatcggac tataaaacag acaaagacgg tattgttgaa cgtttagata ctgctgaagc 

16381 tgagagaacg actttaaaag gtgaaatcaa agataaagtt acgttaaacg aatatcgaaa 

16441 cggattggaa gaacaaaaac aatatactga tgaccagtta agtgatttgt ccaataatcc 

16501 tgagattaaa gcaagtattg aacaagcaaa tcaagaagcg caagaagctt taaaatcata 

16561 cattgatgct caagatgatc ttaaagagaa ggaatcgcaa gcgtatgctg atggtaaaat 

16621 ttcggaagaa gagcaacgcg ctatacaaga tgctcaagct aaacttgaag aggcaaaaca 

16681 aaacgcagaa ctaaaggcta gaaacgctga aaagaaagct aatgcttata cagacaacaa 

16741 ggtcaaagaa agcacagatg cacagaggaa aacattgact cgctatggtt ctcaaattat 

16801 acaaaatggt aaggaaatca aattaagaac tactaaagaa gagtttaatg caaccaatcg 

16861 tacactttca aatatattaa acgagattgt tcaaaatgtt acagatggaa caacaatcag 

16921 atatgatgat aacggagtgg ctcaagcttt gaatgtgggg ccacgtggta ttagattaaa 

16981 tgctgataaa attgatatta acggtaatag agaaataaac cttcttatcc aaaatatgcg 

17041 agataaagta gataaaaccg atattgtcaa cagtcttaat ttatcaagag agggtcttga 

17101 tatcaatgtt aatagaattg gaattaaagg cggtgacaat aacagatatg ttcaaataca 

17161 gaatgattct attgaactag gtggtattgt gcaacgtact tggagaggga aacgttcaac 

17221 agacgatatt tttacgcgac tgaaagacgg tcacctaaga tttagaaata acaccgctgg 

17281 cggttcactt tatatgtcac attttggtat ttcgacttat attgatggtg aaggtgaaga 

17341 cggtggttca tctggtacga ttcaatggtg ggataaaact tacagtgata gtggcatgaa 

17401 tggtataaca atcaattcct atggtggtgt cgttgcacta acgtcagata ataatcgggt 

17461 tgttctggag tcttacgctt catcgaatat caaaagcaaa caggcaccgg tgtatttata 

17521 tccaaacaca gacaaagtgc ctggattaaa ccgatttgca ttcacgctgt ctaatgcaga 

17581 taatgcttat tcgagtgacg gttatattat gtttggttct gatgagaact atgattacgg 

17641 tgcgggtatc aggttttcta aagaaagaaa taaaggtctt gttcaaattg ttaatggacg 

17701 atatgcaaca ggtggagata caacaatcga agcagggtat ggcaaattta atatgctgaa 
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Figure 2G 

17761 acgacgtgat ggtaataggt atattcatat acagagtaca gacctactgt ctgtaggttc 

17821 agatgatgca ggagatagga tagcttctaa ctcaatttat agacgtactt attcggccgc 

17881 agctaatttg catattactt ctgctggcac aattgggcgt tcgacatcag cgcgtaaata 

17941 caagttatct atcgaaaatc aatataacga tagagatgaa caactggaac attcaaaagc 

18001 tattcttaac ttacctatta gaacgtggtt tgataaagct gagtctgaaa ttttagctag 

18061 agagctgaga gaagatagaa aattatcgga agacacctat aaacttgata gatacgtagg 

18121 tttgattgct gaagaggtgg agaatttagg attaaaagag tttgtcacgt atgatgacaa 

18181 aggagaaatt gaaggtatag cgtatgatcg tctatggatt catcttatcc ctgttatcaa 

18241 agaacaacaa ctaagaatca agaaattgga ggagtcaaag aatgcaggat aacaaacaag 

18301 gattacaagc taatcctgaa tatacaattc attatttatc acaggaaatt atgaggttaa 

18361 cacaagaaaa cgcgatgtta aaagcgtata tacaagaaaa taaagaaaat caacaatgtg 

18421 ctgaggaaga gtaatcctta gcactatttt tatacaaaaa tttaaggagg tcatttaatt 

18481 atggcaaaag aaattatcaa caatacagaa aggtttattt tagtacaaat cgacaaagaa 

18541 ggtacagaac gtgtagtata tcaagatttc acaggaagtt ttacaacttc tgaaatggtt 

18601 aaccatgctc aagattttaa atctgaagaa aacgctaaga aaattgcgga gacgttaaat 

18661 ttgttatatc aattaactaa caaaaaacaa cgtgtgaaag tagttaaaga agtagttgaa 

18721 agatcagatt tatctccaga ggtaacagtt aacactgaaa cagtatgaaa agctatgagt 

18781 tagatactca tagtctttat. tcttttagaa agcgggtgta ctgaattggg gtggttcaaa 

18841 aaacacgaac atgaatggcg catcagaagg ttagaagaga atgataaaac aatgctcagc 

18901 acactcaacg aaattaaatt aggtcaaaaa acccaagagc aagttaacat taaattagat 

18961 aaaaccttag atgctattca aaaagaaaga gaaatagatg aaaagaataa gaaagaaaat 

19021 gataagaaca tacgtgatat gaaaatgtgg gtgcttggtt tagttgggac aatatttggg 

19081 tcgctaatta tagcattatt gcgtatgctt atgggcatat aagagaggtg attaccatgt 

19141 tcggattaaa ttttggagct tcgctgtgga cgtgtttctg gtttggtaag tgtaagtaat 

19201 agttaagagt cagtgcttcg gcactggctt tttattttgg ataaaaggag caaacaaatg 

19261 gatgcaaaag taataacaag atacatcgta ttgatcttag cattagtaaa tcaattctta 

19321 gcgaacaaag gtattagccc aattccagta gacgatgaaa ctatatcatc aataatactt 

19381 actgtagtcg ctttatatac aacgtataaa gacaatccaa catctcaaga aggtaaatgg 

19441 gcaaatcaaa aattaaagaa atataaagct gaaaataagt atagaaaagc aacagggcaa 

19501 gcgccaatta aagaagtaat gacacctacg aatatgaacg acacaaatga tttagggtag 

19561 gtggttgata tatgttaatg acaaaaaatc aagcagaaaa atggtttgac aattcattag 

19621 ggaaacaatt caacccagat ggttggtatg gatttcagtg ttatgattac gccaatatgt 

19681 tctttatgtt agcgacaggc gaaaggctgc aaggtttata tgcttataat atcccgtttg 

19741 ataataaagc aaagattgaa aaatatggtc aaataattaa aaactatgac agctttttac 

19801 cgcaaaagtt ggatattgtc gttttcccgt caaagtatgg tggcggagct ggacacgttg 

19861 aaattgttga gagcgcaaat ttaaatactt tcacatcatt tggtcaaaac tggaacggta 

19 921 aaggttggac taatggcgtt gcgcaacctg gttggggtcc tgaaactgtg acaagacatg 

19981 ttcattatta tgacaatcca atgtatttta ttaggttaaa cttccctaac aacttaagcg 

20041 ttggcaataa agctaaaggt attattaagc aagcgactac aaaaaaagag gcagtaatta 

20101 aacctaaaaa aattatgctt gtagccggtc atggttataa cgatcctgga gcagtaggaa 

20161 acggaacaaa cgaacgcgat tttatacgta aatatataac gcctaatatc gctaagtatt 

20221 taagacatgc aggacatgaa gttgcattat acggtggctc aagtcaatca caagatatgt 

20281 atcaagatac tgcatacggt gttaatgtag gcaataaaaa agattatggc ttatattggg 

20341 ttaaatcaca ggggtatgac attgttctag aaatacattt agacgcagca ggagaaagcg 

20401 caagtggtgg gcatgttatt atctcaagtc aattcaatgc agatactatt gataaaagta 

20461 tacaagatgt tattaaaaat aacttaggac aaataagagg tgtgacacct cgtaatgatt 

2 0521 tactaaatgt taatgtatca gcagaaataa atataaatta tcgtttatct gaattaggtt 

20581 ttattactaa taaaaatgat atggattgga ttaagaaaaa ctatgacttg tattctaaat 

20641 taatagccgg tgcgattcat ggtaagccta taggtggttt ggtagctggt aatgttaaaa 

20701 catcagctaa aaacaaaaaa aatccaccag tgccagcagg ttatacactc gataagaata 
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Figure 2H 



20761 atgtccctta taaaaaagaa caaggcaatt acacagtagc taatgttaaa ggtaataatg 

20821 taagagacgg ttattcaact aattcaagaa ttacaggggt attacccaac aacacaacaa 

20881 ttacgtatga cggtgcatat tgtattaatg gttatagatg gattacttat attgctaata 

20941 gtggacaacg tcgttatata gcgacaggag aggtagacaa ggcaggtaat agaataagta 

21001 gttttggtaa gtttagcacg atttagtatt tacttagaat aaaaattttg ctacattaat 

21061 tatagggaat cttacagtta ttaaataact atttggatgg atgttaatat tcctatacac 

21121 tttttaacat ttctctcaag atttaaatgt agataacagg caggtacttc ggtacttgcc 

21181 tattttttta tgttatagct agccttcggg ctagtttttt gttatgatgt gttacacatg 

21241 catcaactat ttacatctat ccttgttcac ccaagcatgt cactggatgt tttttcttgc 

213 01 gatagagagc atagttttca tactactccc cgtagtatat atgactttag cattcccgta 

21361 taacagttta cggggtgctt ttatgttata attgctttta tatagtagga gtgaactata 

21421 tagccgggca gaggccatgt atctgactgt tggtcccaca ggagacatct tccttgtcat 

21481 cactcgatac atatatctta acaacataga aatgttacat tcgctataac cgtatcttaa 

21541 tcgatacggt tatatttatt cccctacaac caacaaaacc acagatccta ttaatttagg 

21601 attgtggtta ttttttgcgt ttttttgggg caaaaaaagg gcagattatt tgaaaaaggg 

21661 caaacgcttg tggaaaagct aaaaggttaa aaatgacaaa aaccttgata caacagtgtt 

21721 tttggacgct cgtgtacgtt agagaatgac cggtttacca tcatacaagg gtgggattaa 

21781 cttgtgttaa aaagccttta atatcagttg ttacaaagga tttgtagcgt ctttaaaaat 

21841 aaaaaagggc agaaaaaggg cagatacctt ttagtacaca agtttttcta atttttgctc 

21901 taactctctg tccattttct ctgttacatg tgtatacacc tttatagtcg ttttttcatc 

21961 tgtatgtcct actcttttca taattgcttt taacgatata ttcatttccg ccaataaact 

22021 tatgtgtgta tgccttagtg tgtgagtagt aactttttta tttatattta atgattctgc 

22081 agctgaggac aatcgtttgt ttatcctact gccttgcata ggatttcctt ggcaagttgt 

22141 gaatataaac cctctatcaa catagcttgg ttcccattgt tgcatctttt tattttctaa 

22201 cattattttt ttcaatacat ttgctatcct tgaattgatg gcgatttttc ttcttgaacc 

222 61 tgcggtctta gtagtatctt tgtgaccaaa tccagcatta catttgattc tgtgaatagt 

22321 gccattaata gcgatcgttt tatttttgag gtcaacatct ttaacttgga gagctaataa 

22381 ctcacctatg cgcatacctg ttaaagcttg aacttctaca gccccagcaa ctaaaatacg 

22441 agctctatac tgcatgttat tatcgttcag tataaaatcg cgtatctgta ttacctgttc 

22501 catctctaaa tagttataca ttttcgcttc ttctttttct atatcttcta tcgtcttact 

22561 cttctttggt agtgtgacgc tatttaatat gtgttcgttt ggataattgt aaaatttaac 

22621 ggcgtattta atagcttctt tcatatgtcc aagttgacgc tttacctgat ttgcagaata 

22681 tacgtttgat aatttgttaa taaatgtttg catgtacttt gtatcaattt tgtttaaaag 

22741 taaattttga gaactgttct ttttgatgtt tttgattctt gttttcaaat tatcaagcgt 

22801 cgttacttta aagccagatg tttttatatg atattcaagc cattcatcta ataacgcgtg 

22861 aaaagtcaaa gtttttaatt cgcttgacga cttgttgttt agtttttctt ttattttttc 

22921 ttctaaacga aacattgcct ctttttgcga ttgctttgta ttcttattca agacaacact 

22981 tacacgtttc catttatctg tatacggatc tttgtatttc tcgtagtatc tatacttcgt 

23041 ttcattgttc ttatttttaa atttttcaaa ccacatttta catccctcct caaaattggc 

23101 aaaaaataat aagggtaggc gggctaccca tgaaaattgt ataaaaaaag acgcctgtat 

23161 aaaatacaga cgccacttat aattataaga ttacatggtt aattaccaaa aatggtaacg 

23221 aatatatacg tgttttaaag gataaacctt taatatatta aaattatatc atcttatatc 

23281 agggatctgc aatatattat tattaattct atttatcagt aacataatat ccgaagaatc 

23341 tattactgga tttttaattt tttggggtaa aacttttctt atgcgaaact tactaatcgg 

23401 ctggaaagaa tttatgcaag cgtaactatt accttttaat ttttttacct tatcaattgc 

23461 tgatactatg ttattaatgt ttctgtcaat tttatttaat ttattttcaa tttctaaact 

2 3 521 atcagatata aattcaataa aataatcttt agtgatgaat tctgtgttgt ttttttggta 

23581 ttttttatcg aaaacttctt ttaatatagc tgaattattt tgcgcgctaa ttaaatttaa 

23641 aaacaatctt aaataatact cccatttcaa atcaaaattc atctttaaat actttttgtt 

23701 ttctttagag gataagggaa taacatttac tatatcctcc gtattagaat catttttatt 
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Figure 21 



23761 
23821 
23881 
23941 
24001 
24061 
24121 
24181 
24241 
24301 
24361 
24421 
24481 
24541 
24601 
24661 
24721 
24781 
24841 
24901 
24961 
25021 
25081 
25141 
25201 
25261 
25321 
25381 
25441 
25501 
25561 
25621 
25681 
25741 
25801 
25861 
25921 
25981 
26041 
26101 
26161 
26221 
26281 
26341 
26401 
26461 
26521 
26581 
26641 
26701 



catcactatt 
aaaaactatt 
aaatctcttg 
agtttttaat 
gctcacactc 
cataatgaat 
tatttacgcg 
ccactagtta 
attataaact 



gcaaagtgtg 
tctccttgtt 
agtaaatagt 
ttattaatgc 
tcaccaccat 
cttctttggt 
cattatgtga 
aaacttcata 
ccttttaaac 



tactttaatt ctttaatcca 



cttaaagtta 
cgctaaatat 
gaagcgactt 
catatatcta 
tctttaatag 
attgaatcac 
aaaaatactt 
gcaccacatg 
ttattctgtt 
gtgagttgag 
tcaggaactc 
gttttagata 
gataagtgac 
agaatatcta 
ttttgcattg 
caaaagttca 
ttatagttca 
ttttcgttat 
gctttaggtg 
acaacagaca 
tattttttta 
tgcaagaacg 
ttaggacaaa 
tacaacaaac 
gatgaaatta 
ttagacaagc 
aaagtgaatc 
cagatgtgag 
gcgataaccg 
ttacaaacat 
tattttgtag 
agaaatcatg 
agaaatatga 
caaagcaaaa 
gatgtcctac 
acattaaaag 
caaaacttac 
tcggtagctg 
aacggtgttg 
attaaaaaga 



agattgcttt 
acgttattaa 
tgatatcatc 
cacgcttgat 
aatcttcttt 
cattaactaa 
cttcatgcaa 
caatatacga 
catctaattg 
aaaatatgtt 
gataagaatc 
ataagaataa 
tttttgacat 
cttgacgcaa 
gtaatgcctc 
acttttttaa 
aatgtttgaa 
tgaacggtaa 
tatcagaaag 
ttattaaagc 
aacagaaagt 
agaaaaggtt 
ttgaagctta 
acctaagcat 
ctaaaaagct 
ttaacggtgt 
agtaacattc 
cgagagctgg 
tctgctgaat 
ttaattttaa 
gaaaagatat 
ttgatagcga 
tcattatcaa 
acgaaaaaat 
cagctattcg 
atccagacta 
ttttacaaca 
gtagtgataa 
atataggaca 
gtggagaaag 



aattagaaaa 
taaactttgg 
gaatatctga 
gtttttctat 
tcaacgtcta 
taacttatcg 
cgataaatct 
tactatagtt 
actgctgaaa 
catatattta 
tttcatgtca 
tcacaataca 
atacttcgga 
aagacttact 
cttaataaaa 
aatacaaaaa 
tatgtcatca 
tactagttta 
ctcatttgca 
attgattttt 
tacatcatac 
tttatgttgg 
tttaatattc 
gttcctatct 
cttgaaattc 
ctttttgtgt 
cttaggaggt 
gatagtcgaa 
aactttgtct 
ttgtaagtta 
tcaaatgttt 
aataaaagta 
cgacaaaacg 
gaacaacgaa 
acgaagtgct 
tgaaagcgat 
acttcttaat 
cgatgatatg 
gtgggtgttg 
agagctacca 
tgctgagatt 
ggacaagctg 
cgaatcagga 
tagagaaacc 
caaacacggt 
catcattaca 
gcaagtagaa 
ttcaatactt 
aaacagattg 
ttataactta 



ttctttatta 
ataaaaacct 
atctaacttt 
attatgcgtc 
cacttgtagg 
ccatctattt 
ttaggtaact 
tcttttttta 
tagacgtctt 
aaagtgaggt 
atttctcctt 
actttgccca 
tttagagata 
ccatctaata 
gcgtatgttc 
tcagcatttg 
tataattctt 
gactctttat 
tagttaagta 
gacattatcg 
cccataagcc 
tctggagaag 
aattcttttt 
ttcataattt 
attatatagg 
tgacattgtt 
gattatttga 
gtgtactcga 
ttgaagttga 
ttgggaatac 
gaacttaata 
acacatcttc 
cttaaagaaa 
gacgcattcg 
atcaaagagt 
aacaaagtta 
ataaccacgc 
agccgcgttt 
aggaaaaagg 
gtaagaacag 
ttaggatatg 
acgcaccaat 
ttatacagtc 
gctagaaaat 
atatacgcaa 
gtgttgactg 
gttaacaaac 
gttggagaac 
ttcaaatggt 
ccaactcaaa 



acgtttatac 
ttatggtttt 
ttaaattttg 
atcatttctc 
cgttttttga 
tttgtgaaat 
cataagtgaa 
ttttgcaatt 
tttcaaataa 
agtaggtaat 
tgtttatatt 
ttactttaat 
ccaaattaat 
caacgagtgc 
cttgttttaa 
atggcgtttc 
ctcctatgcc 
attcatctat 
cgttttcttg 
tttcatcttg 
acgcttcacc 
accttccatt 
gaaagggttt 
gttttaatct 
aagggaaata 
caaaattggg 
atactaatac 
cacaatttaa 
acaacaaagt 
ctataaaaga 
agtaaaggag 
aaatgaagca 
taaagtacac 
ctggtttgga 
tccaaaaagt 
ctgatttaac 
ttatcaacat 
aaatacattc 
aggatactca 
tagaaattga 
caagatcaaa 
ttagtgcatc 
taatcttcga 
tcaaacgctg 
cagacaatgt 
agtataagaa 
caaaagtatt 
tagcgaaaat 
taagaaataa 
agagtatgga 



cgaaatctac 
tttcaccttc 
gatttccaga 
ctttattctc 
ttagtaaaat 
aaattccaag 
tggttgatta 
agttattttc 
gcatgattaa 
aaatataaga 
tatattaaag 
atcactaaac 
atagtcttcg 
aattgtacca 
cataggttcc 
gtcttcttta 
agcaccagtt 
agaagtgact 
gcggggaggt 
acgttcttcg 
gacatttaaa 
aacatactgg 
cgacttttct 
ttcagaagtg 
aaaatcaata 
gttatagtta 
aacttttgat 
ctttgctata 
accatggaaa 
tgttcacaaa 
gcataacaca 
tcaaaacctt 
tcgagacctt 
aatggtagag 
agtgaaagcg 
agagtggcgg 
ccacattgag 
gatagtcatt 
aatgcaagca 
aaacgaacct 
caatgccatt 
aggtcaaaac 
tgcttctaaa 
ggtaacatca 
aattgaacaa 
agaaaaagag 
attcgctgac 
acttaaacaa 
tggatatctc 
tctaaaaatc 
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Figure 2 J 



26761 
26821 
26881 
26941 
27001 
27061 
27121 
27181 
27241 
27301 
27361 
27421 
27481 
27541 
27601 
27661 
27721 
27781 
27841 
27901 
27961 
28021 
28081 
28141 
28201 
28261 
28321 
28381 
28441 
28501 
28561 
28621 
28681 
28741 
28801 
28861 
28921 
28981 
29041 
29101 
29161 
29221 
29281 
29341 
29401 
29461 
29521 
29581 
29641 
29701 



ttggatatca 
ccaaaagtaa 
acatcttaaa 
aattatagca 
aattttcagt 
tttcgcaaaa 
aaagtatcag 
ccatattaga 
aagtgaatac 
ctatgaaaag 
atgttacaaa 
catgctagtt 
ttgttgaaaa 
tttgagatcc 
cttttgtaac 
catgtttttt 
gttgataaca 
tcgaacatcg 
ggtcgagaac 
atgcttaaat 
acagctcaag 
gaaatcgcaa 
gttgaaaaag 
aacacaatca 
gatgcagtag 
caaaacggta 
cttattaaac 
ttattcgaaa 
acgccaaaag 
caaacaactt 
acaatggcag 
attagtagca 
gctgaaaatt 
aaataacaac 
caccagaaaa 
cacaaatcca 
accgcaaaga 
tgattaatat 
aggatattaa 
cttagcgatt 
cgcaagtatc 
ctacttgttg 
aacgaaaaac 
ttcatgttaa 
tacaagttaa 
acttagatat 
cagacgaaca 
agactgtaac 
ctgataacaa 
gtatggaaga 



aaaaacgaat 
caggcaaagg 
aggaggaaca 
aaagaagtta 
aaagtaagaa 
gatttgtcgc 
catggcttcg 
aaattaacat 
aacctagcag 
agagtttcag 
aatttagaat 
actgtttaga 
aggttagcta 
aacaaacaca 
agtttcaatt 
tgaaccatcc 
acattataca 
gagaaatgtt 
ttcatcaagc 
acggatttga 
gcaatatgac 
tgattcaacg 
catggaacag 
atcaattaga 
ctactactaa 
taaacatcgg 
gcaagggtgt 
ttaaagaaac 
taacaggtaa 
aataggagga 
ttgtgacgtg 
gggcgttgag 
ctactgaatc 
attatacacg 
cacatataga 
tcaattgttt 
taatttaggt 
ttctaaattg 
atgagcaaca 
gtacttatgc 
gcaacattca 
gagcaagtaa 
ggaggaagtc 
cggattcgat 
agatatgaac 
ggcatcagac 
ggacagacta 
ttatatcatt 
ttcagatatt 
agcgagtatc 



aattaataat 
acaacaatac 
caatggaaca 
gagaggctat 
tcaataatga 
taggaagatt 
aatcaattca 
tatcaatttt 
caaaagttta 
aattaactat 
tgcgaaagaa 
aagaaacaac 
aattcaacgg 
taagttttag 
ccttcatatc 
tttaaagtaa 
cgaaaggagc 
caatattcaa 
attagaagtt 
agaaaataca 
tcactatatt 
tagtgaacct 
cccagaaatg 
aacaaagatt 
gacatcaatt 
gcaacgcaga 
ggattataac 
atcaatcaca 
aggacaacaa 
attacaaatg 
gaaggtttgg 
tgactatcta 
tgctcgtcgc 
aaaggaaaga 
ggcgaagaaa 
ggagtatgta 
gtagaaaatt 
gaagagtatt 
tttataaaag 
cgtttctata 
tgtactacaa 
cagtatcaaa 
aagatgtatt 
tttaagctat 
aacgtaccaa 
ttatttaacc 
attaacttag 
cgtcataggg 
agttactcca 
aatatggatt 



ccagatggtt 
tttgttaata 
aatcacatta 
aaatggcaag 
cgatttagaa 
gaggaagctc 
tcaaaaagct 
tggagtgaca 
tcgagaaatc 
cgatgatttc 
aaaaataaat 
aaccctgaac 
taaggatttg 
tagggtctag 
ctggaaaaac 
ctagaagttt 
ataaacaata 
gaaaaagaaa 
aagacagcat 
gattacacag 
gaccacgcac 
ggcaaacgtg 
attatgcaac 
gcacgtgaca 
ttagttggag 
ttgtttgagt 
atgcctacac 
cattcggacg 
tactttgtta 
aacgcactat 
aagattgaga 
aacaacaaat 
cttttgaagt 
tagaaatgcc 
aatttgtgaa 
gaagtacagt 
tatacattga 
tgatcagaaa 
ctacctagta 
cttcactaca 
agaatgcttt 
cacttaagaa 
acgaaatagg 
tcattttaaa 
ttaaacatgc 
aagcaataga 
tcatgaaatg 
atatgccaat 
caaatagaaa 
atcacaaagc 



caagtaaagt 
agtttttagg 
accaaagaag 
aaaccaatca 
gaaatcaata 
aatcatccga 
tatgtacaag 
cttaattcag 
aaaaactatt 
gaataaagga 
taaaactcaa 
tgttgcgagc 
ccctgcctcc 
aaaaaatgtt 
aattttcttt 
catacttatc 
tgcaagcatt 
atggagaaat 
ataaagattg 
ctatcgctca 
tcacactaga 
caagacaata 
gtgctttaaa 
aaccaaaaat 
agttagcaaa 
ggttacgtca 
agtattcaat 
gtcacacatc 
acaagttttt 
acaaaacaac 
agcacactag 
ctttaaccat 
tcgccgaaca 
aaaaatcata 
aaagttatac 
atacaactgg 
ttattcacca 
gcataaaaaa 
gcagtattat 
gcatggtcaa 
ttcaaagaat 
aaaattcatg 
cgaaatcata 
aggtcatatg 
ttatgtcgta 
tgaatggatt 
gtaggaggtc 
ttatataact 
tagagctagg 
aatcaagaaa 



atcacgtaca 
agaaaaacaa 
agttgaaaga 
gttcaggttc 
aaaaactcaa 
ttccgctaaa 
atgttcatga 
acttgagtga 
atttatacat 
ggaacaacaa 
attactcaag 
agttgcagag 
acacttagag 
tcgatttcct 
aaatccgaaa 
acctccttag 
acaaacaaat 
cgcaatcagc 
gtttccaaga 
aaaaagagca 
cactgcaaaa 
tttcatccaa 
aattgctaac 
tgtatttgca 
gatcattaaa 
aaacggattc 
ggaacgtgag 
aattagtaag 
aggagaaaaa 
cctcctcatc 
aaaacctgtg 
accgaaagat 
aactattagc 
gtaccaccaa 
gcaacaccta 
ttgaaatatt 
acaggcactc 
tggtattagg 
gcttcacagt 
ttgcgggatt 
aaaaaaactg 
ttcaatataa 
cgcaaaaata 
ggcatatcaa 
gatgagaatg 
gaagagaaca 
gctatgaagc 
aacaaaccaa 
gagtttaacg 
acagtgacag 



Jerry PELLETIER etal 
Compositions and Methods.. 
USSN 09/689,852 



Figure 2K 



29761 aaactattga gtacgaggag gtagaacatg actgaggaaa aacaagaacc acaagaaaaa 

2 9821 gtaagcatac tcaaaaaact aaagataaat aatatcgctg agaaaaataa aaggaaattc 

29881 tataaatttg cagtatacgg aaaaattggc tcaggaaaaa ccacgtttgc tacaagagat 

2 9941 aaagacgctt tcgtcattga cattaacgaa ggtggaacaa cggttactga cgaaggatca 
30001 gacgtagaaa tcgagaacta tcaacacttt gtttatgttg taaatttttt acctcaaatt 
30061 ttacaggaga tgagagaaaa cggacaagaa atcaatgttg tagttattga aactattcaa 

3 0121 aaacttagag atatgacatt gaatgatgtg atgaaaaata agtctaaaaa accaacgttt 
3 0181 aatgattggg gagaagttgc tgaacgaatt gtcagtatgt acagattaat aggaaaactt 
30241 caagaagaat acaaattcca ctttgttatt acaggtcatg aaggtatcaa caaagataaa 
3 03 01 gatgatgaag gtagcactat caaccctact atcactattg aagcgcaaga acaaattaaa 
30361 aaagctatta cttctcaaag tgatgtgtta gctagggcaa tgattgaaga atttgatgat 
3 0421 aacggagaaa agaaagctag atatattcta aacgctgaac cttctaatac gtttgaaaca 
30481 aagattagac attcaccttc aataacaatt aacaataaga aatttgcaaa tcctagcatt 
3 0541 acggacgtag tagaagcaat tagaaatgga aactaaaaat taattaaaag gacggtattt 
30601 aattatgaaa atcacaggac aagcgcaatt tactaaagaa acaaatcaag aaaagtttta 
30661 taacggctca gcagggtttc aagctggaga attcacagtg aaagttaaaa atattgaatt 
30721 caatgataga gaaaatagat atttcacaat cgtatttgaa aatgatgaag gcaaacaata 
30781 taaacataat caatttgtac cgccgtataa atatgatttc caagaaaaac aattgattga 
30841 attagttact cgattaggta ttaagttaaa tcttcctagc ttagattttg ataccaatga 
30901 tcttattggt aagttttgtc acttggtatt gaaatggaaa ttcaatgaag atgaaggtaa 
30961 gtattttacg gatttttcat ttattaaacc ttacaaaaag ggcgatgatg ttgttaacaa 
31021 acctattccg aagacagata agcaaaaagc tgaagaaaat aacggggcac aacaacaaac 
31081 atcaatgtct caacaaagca atccatttga aagcagtggc caatttggat atgacgacca 
31141 agatttagcg ttttaaggtg tggtttaaat gcaatacatt acaagatacc agaaagataa 

312 01 cgacggtact tattccgtcg ttgctactgg tgttgaactt gaacaaagtc acattgactt 
31261 actagaaaac ggatatccac taaaagcaga agtagaggtt ccggacaata aaaaactatc 
31321 tatagaacaa cgcaaaaaaa tattcgcaat gtgtagagat atagaacttc actggggcga 

313 81 accagtagaa tcaactagaa aattattaca aacagaattg gaaattatga aaggttatga 
31441 agaaatcagt ctgcgcgact gttctatgaa agttgcaagg gagttaatag aactgattat 
31501 agcgtttatg tttcatcatc aaatacctat gagtgtagaa acgagtaagt tgttaagcga 
31561 agataaagcg ttattatatt gggctacaat caaccgcaac tgtgtaatat gcggaaagcc 
31621 tcacgcagac ctggcacatt atgaagcagt cggcagaggc atgaacagaa acaaaatgaa 
31681 ccactatgac aaacatgtat tagcgttatg tcgcgaacat cacaacgagc aacatgcgat 
31741 tggcgttaag tcgtttgatg ataaatacca cttgcatgac tcgtggataa aagttgatga 
318 01 gaggctcaat aaaatgttga aaggagagaa aaaggaatga atagactaag aataataaaa 
31861 atagcactcc taatcgtcat cttggcggaa gagattagaa atgctatgca tgctgtaaaa 
31921 gtggagaaaa ttttaaaatc tccgtttagt taatacaggt ttttacaaaa gctttaccat 
31981 aggcggacaa actaattgag ccttttttga tgtctattac ccaggggctg taatgtaact 
32041 ttaatacttc aaattcaatg ccagaaagtt tacttattgt ttctaggttg tgtcctgact 
32101 ttaacattct tttaacaaat tctaatcccg aaacaaatct ttgtttttct ataatcttat 
32161 taaagtgatt taaaaactga ggagcataaa acttattata aattcctttt tttgttaagt 
32221 aagacatgtc aaaagtttca tttaaaaccc ctaaccttac taggttatta attgaaattt 
32281 cggttgattc tatatctaac ggagagtctt ttattaacgt gtccgatata ttcataccgt 
32341 cattctttgg gtttaaaacc gctctatatt taacggcagg atgtacttcg tgattcttta 
32401 aatgttttaa aagaatagca tcatttgggg ataattgttt aattatttca acaaatgaat 
32461 ggtgggttaa tgagtttttt ctgtcatcca tagatgatgc tattagtttt gcgaacatat 
32521 tacttaaagt tttttcacta atgtaaaact ttgaagcttc tagagcagga cctagaagag 
32581 aaaattgtgg ttcttgtaaa ttatttttag gtacagaaga tatttctttt ttaaattgtt 
32641 ctttgaattt ttcaaattct acttctcttt gataaataac tttatccaca taaaggtgga 
32701 atttcccaaa gacaagttcc caagttttag agaatgtttc tacaggccct tttgatgcgc 
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Figure 2L 



32761 cttcaataat tttatcaata cctttaccta aaataggatc cataattatt cacccccaat 

32821 ctaacgcaat agcgataata aaattatacc agaaaggaga atcaacatga ctgaccaacc 

32881 aagttactac tcaataatta cagcaaatgt cagatacgat aaccgactta ctgacagcga 

32941 aaagttactt tttgcagaaa taacatcttt aagtaacaaa tacggatact gcacagcaag 

33001 taatggttac tttgcaactt tatacaacgt tgttaaggaa actatatctc gtagaatttc 

33061 gaaccttacc aactttggtt atctaaaaat cgaaattatc aaagaaggta atgaagttaa 

33121 acaaaggaag atgtacccct tgacgcaaac gtcaatacct attgacgcaa aaatcaatac 

33181 ccctattgat aattctgtca atacccctat tgacgcaaat gtcaaagaga atattacaag 

33241 tattaataat acaagtaata acaatataaa tagaatagat atattgtcgg gcaacccgac 

33301 agcatcttct ataccctata aagaaattat cgattactta aacaaaaaag cgggcaagca 

33361 ttttaaacac aatacagcta aaacaaaaga ttttattaaa gcaagatgga atcaagattt 

33421 taggttggag gattttaaaa aggtgattga tatcaaaaca gctgagtggc taaacacgga 

33481 tagcgataaa taccttagac cagaaacact ttttggcagt aaatttgagg ggtacctcaa 

33541 tcaaaaaata caaccaactg gcacggatca attggaacgc atgaagtacg acgaaagtta 

33601 ttgggattag ggggatatta tgaaaccact attcagcgaa aagataaacg aaagcttgaa 

33661 aaaatatcaa cctactcatg tcgaaaaagg attgaaatgt gagagatgtg gaagtgaata 

33721 cgacttatat aagtttgctc ctactaaaaa acacccgaat ggttacgagt ataaagacgg 

33781 ttgcaaatgt gaaatctatg aggaatataa gcgaaacaag caacggaaga taaacaacat 

33841 attcaatcaa tcaaacgtta atccgtcttt aagagatgca acagtcaaaa actacaagcc 

33 901 acaaaatgaa aaacaagtac acgctaaaca aacagcaata gagtacgtac aaggcttctc 

33961 tacaaaagaa ccaaaatcat taatattgca aggttcatac ggaactggta aaagccacct 

34021 agcatacgct atcgcaaaag cagtcaaagc taaagggcat acggttgctt ttatgcacat 

34081 accaatgttg atggatcgta tcaaagcgac atacaacaaa aatgcagtag agactacaga 

34141 cgagctagtc agattgctaa gtgatattga tttacttgta ctagatgata tgggtgtaga 

34201 aaacacagag cacactttaa ataaactttt cagcattgtt gataacagag taggtaaaaa 

34261 caacatcttt acaactaact ttagtgataa agaactaaat caaaatatga actggcaacg 

34321 tataaattcg agaatgaaaa aaagagcaag aaaagtaaga gtaatcggag acgatttcag 

34381 ggagcgagat gcatggtaac caaagaattt ttaaaaacta aacttgagtg ttcagatatg 

34441 tacgctcaga aactcataga tgaggcacag ggcgatgaaa ataggttgta cgacctattt 

34501 atccaaaaac ttgcagaacg tcatacacgc cccgctatcg tcgaatatta aggagtgtta 

34561 aaaatgccga aagaaaaata ttacttatac cgagaagatg gcacagaaga tattaaggtc 

34621 atcaagtata aagacaacgt aaatgaggtt tattcgctca caggagccca tttcagcgac 

34681 gaaaagaaaa ttatgactga tagtgaccta aaacgattca aaggcgctca cgggcttcta 

34741 tatgagcaag aattaggttt acaagcaacg atatttgata tttagaggtg gacgatgagt 

34801 aaatacaacg ctaagaaagt tgagtacaaa ggaattgtat ttgatagcaa agtagagtgt 

34861 gaatattacc aatatttaga aagtaatatg aatggcacta attatgatca tatcgaaata 

34921 caaccgaaat tcgaattatt accaaaacta gataaacaac gaaagattga atatattgca 

34981 gacttcgcgt tatatctcga tggcaaactg attgaagtta tcgacattaa aggtatgcca 

35041 accgaagtag caaaacttaa agctaagatt ttcagacata aatacagaaa cataaaactc 

35101 aattggatat gtaaagcgcc taagtataca ggtaaaacat ggattacgta cgaggaatta 

35161 attaaagcaa gacgagaacg caaaagagaa atgaagtgat ctaatgcaac aacaagcata 

35221 tataaatgca acgattgata taaggatacc tacagaagtt gaatatcagc attttgatga 

35281 tgtggataaa gaaaaagaag cgctggcaga ttacttatat aacaatcctg acgaaatact 

35341 agagtatgac aatttaaaaa ttagaaacgt aaatgtagag gtggaataaa tgggcagtgt 

35401 tgtaatcatt aataataaac catataaatt taacaatttt gaaaaaagaa ataatggcaa 

35461 agcgtgggat aaatgctgga attgtttcta aacgtgttag aggttgttgg gagttttcag 

35521 aagctttaga cgcgccttat ggcatgcacc taaaagaata tagagaaatg aaacaaatgg 

35581 aaaagattaa acaagcgaga ctcgaacgtg aattggaaag agagcgaaag aaagaggctg 

35641 agctacgtaa gaagaagcca catttgttta atgtacctca aaaacattca cgtgatccgt 

35701 actggttcga tgtcacttat aaccaaatgt tcaagaaatg gagtgaagca taatgagcat 
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Figure 2M 



35761 
35821 
35881 
35941 
36001 
36061 
36121 
36181 
36241 
36301 
36361 
36421 
36481 
36541 
36601 
36661 
36721 
36781 
36841 
36901 
36961 
37021 
37081 
37141 
37201 
37261 
37321 
37381 
37441 
37501 
37561 
37621 
37681 
37741 
37801 
37861 
37921 
37981 
38041 
38101 
38161 
38221 
38281 
38341 
38401 
38461 
38521 
38581 
38641 
38701 



aatcagtaac 
ttacacatac 
accacaatta 
gaatggtcat 
ggagtgatga 
aagagatatc 
tattactttc 
gcggaagagc 
ctaactttat 
acctgaactt 
ttcaaatgat 
tgtgactgga 
aaaatcaaag 
ccggatctat 
tttcatccaa 
atagttgata 
ttcgagattc 
tgtttatatg 
atgacgttaa 
ataaagataa 
tgatttcaac 
aagatgtgca 
aagtaagttt 
aattactaag 
tgctattgga 
tactaactct 
caagaagcaa 
tctagagaag 
aaacgaattc 
ttttatggag 
ctagtgcgta 
agaaagcgca 
ttaaagaagg 
aatatgagga 
gctagaatgc 
actgtcgtac 
ccagagggct 
gtgattgaaa 
aatgatgaag 
gatggattaa 
agaagagttt 
tggacaccgg 
ggcttcggaa 
gtgacgcaat 
actgtggcta 
aaagagaagt 
gaaaatataa 
tttattttgt 
atggttatat 
ttgaatcaat 



agaaaagtag 
ggcgacattg 
gcattcgcaa 
gaggatttag 
ccatgacaga 
tgtatcagga 
acggtcatat 
ttgaaacata 
tttaaaaggg 
atccaatggg 
gttgagcgca 
tatgtatcaa 
ttaaaaaaga 
cacaaggaaa 
atacaaataa 
ttgaaaaaga 
aagaaggaga 
gcagatgtgt 
tctggaaaga 
aaaagttatg 
aggttataaa 
cggtgtggag 
tatcgagttt 
tgaaaatgac 
ggttatgaga 
aacaagttat 
cgagatgagc 
aaagcaagcg 
ggtaacgatg 
gatgacacaa 
taacggtaat 
agcgtttgat 
tattgaactt 
ggaataggaa 
ccgaacgaaa 
tcgaaccaca 
atgtcggact 
caggcaagat 
aacgtgatgg 
taagcatttt 
accaaatcaa 
aactaaagca 
gtagcggagt 
acttagtcac 
gagataatca 
acgaggcaca 

gggagtgtgg 

aaaaacagct 
ttacgcaagt 
cacacttatt 



atatgaacaa 
aaattataga 
taggtaatgc 
caaaggcgaa 
tagcggacgt 
taacgaacga 
cgtgccaggt 
tataaagcaa 
cggaaacaat 
cttgggataa 
actgttttgt 
ttaacgataa 
aatgagatta 
aatatttttt 
gtgttcgacg 
agtaacggaa 
ctataactct 
gcctaccaaa 
tggggagttg 

agtattattg 
agtttcaatg 
atttatgaag 
aaagaaggag 
gatattattg 
tgacgttcac 
tagataaact 
ttattgggga 
catgggatag 
atgaaagagt 
atgaataatc 
gacacagagg 
gaaatacttg 
gatgaagcag 
aatgactaac 
tcataagacg 
agaaaaagca 
attaactagt 
agacgcggga 
aatacccttt 
agatataaaa 
caaaggcgat 
agtggaggaa 
gtaaagacat 
aacattcaaa 
gacgtttaca 
agttaaaaga 
gaaatgacgg 
gaaccttttg 
actataatca 
gaggagcatg 



aacgcaagac 
ttttattgaa 
aattaaatac 
gttttacgtc 
aaagaatact 
gtggcacata 
tggcaaggtg 
agtgatttgg 
gaaaatcaaa 
ccccaagtta 
gacttttcat 
atttactgtt 
gatgaattaa 
tcaacaggat 
tcaagtttta 
gagactaagg 
acactatatg 
gcattctaca 
ctagtatgat 
acgaaatcga 
aagtaaaact 
gggatattgt 
ccttttatat 
aaattgttgg 
cttatcagat 
tcacaaagca 
tatagcgaag 
gtattgcaag 
taaattcgga 
gcgaaaaaat 
ggttgctaaa 
agggaatgac 
tagggattat 
acattacaag 
gatgcaggtt 
gtgatcaaaa 
cgtagtggtg 
tatcatggca 
ttatatgatg 
ggtaactatg 
aaactagctc 
ttcgaaagtg 
cttagatcga 
gattcaacag 
gttattgagg 
gatgcagtta 
atgttaaaat 
aaaaatatgt 
agaaaccaac 
ggaaatgaat 



aacgttaagc 
caagttacgg 
ttgtctagag 
gatagagtat 
taaaacattt 
tccatgtagt 
tgaaaaagac 
aatatgagga 
attgaaaaag 
tcaggtaata 
gttgatagca 
caagaggaga 
ttaaatgggc 
ttagtgatgg 
ttccaattga 
ttgataggtt 
agaacactag 
tcttaaacga 
gttgaaattt 
ttttaatagt 
attacaatac 
tcaagattgt 
aacttttagc 
aaatattttt 
gaacaatata 
ttaaaagatc 
ttacgagatt 
agcgttgaaa 
atggaattaa 
cgaacagtcc 
agagattgag 
aaatgctatt 
ggcaggtcaa 
taaaactatt 
atgacatatt 
cagatgtagc 
taagtagtaa 
atttagggat 
atatagacgc 
tacaagatgg 
aattggttat 
tttcagaacg 
gttaaggagg 
gacgaccaca 
cagagagtaa 
ttaaagtggg 
taaaactatt 
tgaaagaatg 
gtatattaaa 
cagctgagaa 



aacctgcgca 
cacagtaccc 
caccgttaaa 
ttgacttgtg 
tttcggctct 
aaatggcact 
atttgataca 
acagaagcaa 
aaatgaattt 
aaagattcta 
tcttatgtaa 
tataacaatg 
gcgagaaaat 
attcgttcgt 
tatccccttc 
gattgaatta 
tataaaagaa 
tgacctaact 
aaagcttggg 
gggtacattt 
acaggattta 
tattcgagag 
aatgtaactg 
gaaaatgaga 
aaaatctttg 
gtgaagagta 
gtaacaaaga 
aagatttaat 
acaataaaat 
gttattagtg 
gacgtgtata 
caacattcag 
gttgtctata 
atcaaaaaat 
ctcagctgaa 
tgtgagtata 
aacgtattta 
taatatcaag 
tgaattagaa 
aagaggcata 
cgtgcctata 
tggagcaaaa 
ttttggggaa 
tgaacatatt 
agaagaagcg 
tcagttgtat 
tcaggtggag 
acgagtttta 
acagatacga 
ttttattaca 
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Figure 


2N 


38761 


tgacggtagt 


38821 


ggacaatttt 


38881 


tgtattgaac 


38941 


tacctaaaat 


39001 


cacctgaaga 


39061 


gtgaatttta 


39121 


gaatgatgcc 


39181 


atggatgggt 


39241 


ctacttgtta 


39301 


caaggaacta 


39361 


gtattagaca 


39421 


agcgcagata 


39481 


tatagaatac 


39541 


caatgattaa 


39601 


atgtaactga 


39661 


attacgtctt 


39721 


tttgctgttt 


39781 


attaggaatt 


39841 


gtagataaaa 


39901 


gctttattca 


39961 


aatcaattta 


40021 


gattggagct 


40081 


agtctttaat 


40141 


acaaatgatt 


40201 


caatgaattt 


40261 


tattataaat 


40321 


tgaaaataga 


40381 


aaacgattat 


40441 


aaacagaact 


40501 


atgtggatta 


40561 


agtgatcgtg 


40621 


gtagttaaaa 


40681 


aataacgata 


40741 


gagaaacacg 


40801 


gagttgaaga 


40861 


ggcaaaagta 


40921 


aacaggtaca 


40981 


gatgaggata 


41041 


gaatgggaag 


41101 


aatgcactga 


41161 


tgtaagtccg 


41221 


aatgctaata 


41281 


gaggtatgaa 


41341 


tcttgatact 


41401 


cgattctata 


41461 


tatgaatgtc 


41521 


aagtcgttgg 


41581 


ttaaacaatt 


41641 


aaaaaagaaa 


41701 


aagcgat 



agtttgatat 
agaaatgatg 
aaaggttata 
gaaattcccg 
aaaggctaag 
cagtcctacg 
tagtttaatt 
tcgacatctt 
tcacattgcc 
ttacagataa 
acaaacaagt 
tacaagctag 
actttttaaa 
acaaatacta 
gcaagtgtat 
tcgagcggag 
gctttcttaa 
tttggtatgt 
atgaacgagc 
gttaaagaga 
gaacaaatat 
tatattattc 
aatttagata 
aatttatcaa 
agtacaaatc 
gaatatcaaa 
gcttataatc 
aatctttatc 
tcgataaaag 
ctatgactat 
caagagagat 
ctaaagggta 
ttaaaaagta 
atatcaaaaa 
tgcgagaata 
atttgccggg 
atacattaag 
cgcttgagtt 
atatagcaca 
tcgataagtt 
cattaaaaca 
cgacgcatga 
catgttcaaa 
acttaagtta 
agtctaaaga 
aacaatgtaa 
atgtagatca 
tagaaacact 
ataaatggaa 



tacatgaaga 
atgactattt 
tagttgggat 
aaaaagtaca 
attgaagatg 
atggctaata 
gatactggag 
tattgttgga 
tatctataca 
atataacaag 
cattgaaaat 
gttaaaagta 
tttatatccg 
agactattat 
attatgatga 
gtgagtgaat 
tatccatata 
ataaaattat 
aaataatagg 
tttttaggta 
atccgatata 
caacagaaca 
agcaaagtaa 
atagagttaa 
agattttttt 
aagatatatc 
atattgatag 
ttgataaatt 
aaagaattat 
tgtatttgct 
acaagcactt 
caacgggtta 
atttatatta 
gcttgaagaa 
tgaattactt 
taacccgatt 
aaatatagtt 
attaaggttt 
ttactttggt 
agcaaagtat 
gtttattatg 
acaagaggcg 
ctaattgtaa 
tataaggtga 
atggcaaata 
gagagacggc 
tatattatcg 
gtgtattaaa 
agacgaaaaa 



tgaattattt 
aacgatagaa 
caatgttgag 
ctgaaataat 
attttattaa 
tgaatgaata 
atgacaatga 
atactgtcat 
gtggctagtt 
agacaagata 
tccgacttat 
ggcgataagg 
gtcttatacg 
tcttactagc 
cggctaatga 
aatgagaata 
tattattgat 
agattccttt 
aagcatatat 
ttttacagat 
tttagattgt 
gcatgaattt 
aaaagcgtat 
ggcaatggaa 
taatccttct 
ttatttaaaa 
ttttatcact 
tgaagaacag 
tattaattta 
atattgctat 
agatatatga 
gaagaataca 
tcggaggtat 
tacattcagc 
gaaagtcatg 
gaacgatgtg 
aacggtgtag 
agatattggg 
acaagtaaga 
attggttatg 
ttagtatcag 
catcactatg 
atacattact 
aacattatga 
acaagaaaaa 
aagttaacga 
ctagaacatc 
tgtcacaaca 
tggtaaatac 



aacgaaatag 
aaagattatg 
gaggcagatg 
caaaaaatat 
agaaattaaa 
tgaattaagg 
tgattaaaaa 
tattcggtat 
accaacacaa 
aagaagacaa 
tattcaaaaa 
tagaagttaa 
aagtaaagaa 
aatgtatgag 
tgatgtagag 
tttatttatg 
gatggagtga 
tcagaaaata 
actttagcag 
tctaacttac 
tttaaaaagg 
ttagattttt 
gaaaatgtta 
gattttaaga 
tttgttatgg 
aatataatta 
tcagagtacc 
tttagtcaaa 
aacaagagga 
tagtttgtat 
atgattatct 
ggattgaatt 
tgcattgaat 
acatcgataa 
aaccagataa 
caataaagaa 
atagattgat 
attgtcctat 
caagtatatt 
tgtagcggac 
attaatattt 
tgatgtgtct 
acacatcaag 
tgactaaaga 
gagtgctaga 
catatgacaa 
atccggagtt 
aaaaagaaaa 
ccccgggtca 



tatttgtttt 
gcagagaact 
atgattaaca 
aaaaataaag 
gataaagaca 
gctatgttaa 
acttaaaaat 
attcgcattg 
agaattacat 
gttctatatt 
gaaatttgat 
aacaatcggt 
ggtagataaa 
ttaggtaagt 
gcgccgagtg 
atttgatcgt 
taataaatgc 
ttataaagag 
gaggtgttgt 
aacgtaaaaa 
ctaaaaagat 
ttgatattga 
ttggatttag 
tgagtttcaa 
aaacaattgc 
ataaaatgaa 
gacgaaaaat 
agtttaaaat 
gatttaaatg 
cagtattaat 
acttgatgaa 
gaagcgaatg 
gataaagatt 
ctatcgaaga 
tgcgggagct 
gtttagtgat 
aggtgaaagt 
tggttgttat 
acgtagaagg 
ttttacccta 
aaagttatta 
ttttatttat 
tatagatgag 
cgaacgtata 
aagagataat 
aagcaagcgt 
tgctcatgac 
gagatttata 
aaaaaatcaa 
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Figure 4 

SEQ ID NO: 4 (P77ORF104) 

1 atggtaacca aagaattttt aaaaactaaa cttgagtgtt cagatatgta cgctcagaaa 

61 ctcatagatg aggcacaggg cgatgaaaat aggttgtacg acctatttat ccaaaaactt 

121 gcagaacgtc atacacgccc cgctatcgtc gaatattaa 



SEQ ID NO: 5 (P77ORF104) 

1 MVTKEFLKTK LECSDMYAQK LIDEAQGDEN RLYDLFIQKL AERHTRPAIV EY 
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Figure 5 

Predicted tryptic peptide masses of conceptual ORF in Contig 
1383: 

1 MGGGQSIMKqf kSIINTSQDFEKrIEKikK 30 

31 evindpdvkQFLEAHRaeltnamidedlnv 60 

61 laevkDOOKhvdahkFADCPNFV Kahvpel 90 

91 ^dnnrlKirYLQCPCKikYDEERfeaeli 120 

121 t shhmqrDTLNAKlkDIYMNHRdr 1DVAMA 150 

151 ADDICTAITNGEQVKglylygpf gtgkSFI 180 

181 LGAIANOLKskKvr STIIYLPEFIR tlkGG 210 

211 FKdgsfekKlhrVReanilmlddigaeevt 240 

241 pwvrDEVIGPLLHYRmvhelptff ssnfdy 270 

271 selehhl amt rDGEEKtkAARi ierV Ks 1 s 300 

3 01 tpyflsaenfrN N 



Tryptic peptide fragment: 

GHVPELYVDNNR Predicted peptide mass MH+ = 1413.538 

STIIYLPEFIR Predicted peptide mass MH+ = 1352.6221 

SLSTPYFLSGENFR Predicted peptide mass MH+ = 1618.7923 
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Figure 6A 

Optimal global alignment 

Sequence 1 SEQ ID NO: 6 DnaC nucleotide B . subtilis (1471 letters) 

Sequence 2 SEQ ID NO: 7 DnaC nucleotide S. aureus (1513 letters) 

seql 1 AT-GACAGACCTTCTGAATGACCGGCTTC CTCCGCAAAATATAGAAGCCGAACAAGC 56 

II II Ml II llll I II Mill II I Mill Mill I 

seq2 1 ATGGATAGA ATGTATGAGCAAAATCAAATGCCGCATAACAATGAAGCTGAACAGTC 56 

seql 57 CGTGTTAGGCGCTATTTTTTTACAGCC - GTCTGCTTTAACACTGGCTTCAGAAGTATTGA 115 

ii inn i mi ii ii i ii i i 1 1 1 1 mi ii mil mi 

S eq2 57 TGTCTTAGGTTCAATTATTATAGATCCAGAATTGATTAATACT - ACTCAGGAAGTTTTGC 115 
seql 116 TTCCAGATGATTTCTATAGAATGTCCCACCAAAAAATCTATAATGCGATGCTGGTGCTCG 175 

MM II II Mill llll Ml I II I III III II I 

seq2 116 TTCCTGAGTCGTTTTATAGGGGTGCCCATCAACATATTTTCCGTGCAATGATGCACTTAA 175 
seql 176 GTGACCGAGGTGAACCGGTTGATCTGGTGACA- - GTTACATCAGAGCTTGCGAACACAGA 233 

Ml I II Mill I II III I I Ml I I III M 

S eq2 176 ATGAAGATAATAAAGAAATTGATGTTGTAACATTGATGGATC - - AATTATCGACGGAAGG 233 
seql 234 CCTGCTGGAAGAAGTAGGCGGTATTTCATAT - TTG - ACAGATATCGCAAACTCGGTGCCG 291 

I II I llll II II llll III I II llll II I ll ll 

seq2 234 TACGTTGAATGAAGCGGGTGGCCCGCAATATCTTGCAGAGTTATCTACAAAT - - GTACCA 2 91 
seql 2 92 ACAGCGGCTAACATAGAATATTACGCGAAAATCGTTGAGGAAAAATCGATT - CTTCGCCG 3 50 

II II llll Mill I I llllll II II III II I 

seq2 2 92 ACGACGCGAAATGTTCAGTATTATACTGATATCGTT - TCTAAGCATGCATTAAAACGTAG 3 50 
seql 3 51 ATTAATCAGAACTGCGACAACGATTGCTCAAGACGGGTATACCCGTGAGGATGAGGTCGA 410 

Mill llllll I Mill I II II MM III III III 

seq2 351 ATTGATTCAAACTGCAGATAGTATTGCCAATGATGGATATAATGATGAACTTGAACTAGA 410 
seql 411 - - GGATTTACTCAGTGAAGCGGAAAAAACGATTATGGAAGTGGCA- CAGCGCAAAAACAC 467 

Mill I Mill II III III I II I II I I I II I 

Seq2 411 TGCGATTT - - TAAGTGATGCAGAACGTCGAATTTTAGAGCTATCATCTTCTCGTGAAAGC 468 
seql 468 GAGTGCCTTCCAAAATATTAAGGACGTCCTTGTCCAGACCTATGATAATATC - GAACAGC 526 

II M Ml II I III MUM I I II Mill II I I III III 

seq2 46 9 GA - TGGCTTTAAAGACATTCGAGACGTCTTAGGACAAGTGTATGA - AACAGCTGAAGAGC 526 
seql 527 TTTACAATCGAAAAGGTGAT- - ATCA- CGGGAATTCCAACAGGGTTTACGGAGCTTGACC 583 

, ii mi in i ii i i ii i ii ii ii mil i i ii iiiii 

seq2 527 TT GATCAAAATAGTGGTCAAACACCAGGTATACCTACAGGATATCGAGATTTAGACC 583 

seql 584 GGATGACTGCGGGTTTCCAGCGCAACGACTTGATCATTGTGGCTGCCCGTCCTTCAGTAG 643 

Mill II II Ml I II II II II II 11 I II II Mill II llll 

seq2 584 AAATGACAGCAGGGTTCAACCGAAATGATTTAATTATCCTTGCAGCGCGTCCATCTGTAG 643 

seql 644 GGAAAACAGCCTTTGCCCTGAACATCGCACAAAACGTGGCGAC GAAGACCGATG - A 698 

I I I M II I I II I I II I I II I II II I I I I I I I II M I llll 

seq2 644 GTAAGACTGCGTTCGCACTTAATATTGCACAAAAAGTTGCAACGCATGAAGA — TATGTA 701 
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Figure 6B 


seal 


699 


seq2 


702 


seal 


759 


seq2 


762 


seql 


816 


seq2 


819 


seql 


876 


seq2 


879 


seql 


936 


seq2 


939 


seql 


993 


seq2 


999 


seql 


1053 


seq2 


1059 


seql 


1113 


seq2 


1119 


seql 


1173 


seq2 


1179 


seql 


1209 


seq2 


1239 


seql 


1248 


seq2 


1299 


seql 


1307 


seq2 


1358 



GAGCGTAGCGATTTTCAGTCTTGAGATGGGTGCCGAGCAGCTCGTTATGCGTATGCTCTG 758 

i mi Mini ii i in ii n ii i ii mi i i i Mini i ii 

TACAGTTGGTATTTTCTCGCTAGAGATGGGTGCTGATCAGTTAGCCACACGTATGATTTG 761 

TGCCGAGGGAAATATCAATGCCCAGAATC TCCGTACAGGTAACCTGACCGAAGAGGA 815 

I MINI I III III I I II 1 1 1 1 II 1 1 II II II 

TAGTTCTGGAAATGT TGACTCAAACCGCTTAAGAACGGGTACTATGACTGAGGAAGA 818 



III I I I I I I II I I I I I II I I I I I I I I I I I I I I 



III I II 1 1 1 1 1 1 1 1 MINIM II III I I M I I II I 



875 
878 



TACACCGGGTATTCGAGTGAGTGAAATCCGTGCCAAGTGCCGCCGCTTGAAGCAGGAAAG 935 

UN MM I I I III I III I M M II II II Mill III 

TACACCGGGTATTCGAATTAATGATTTACGTTCTAAATGTCGTCGATTAAAGCAAGAACA 938 

CGGGCTGGGCATGATTTTGATCGATTACCTGCAATTGATTCAGGGAAGCGGT CGTTC 992 

II I I 1 1 1 1 1 II MM II III I II 1 1 1 1 1 1 1 1 M II III III I 

TGGCTTAGACATGATTGTGATTGACTACTTACAGTTGATTCAAGGTAGTGGTTCACGTGC 998 



1058 



GGAGCTGCAAGTCCCTGTTATCGCGCTTTCTCAGCTTTCCAGGGGTGTTGAGCAGCGTCA 1112 

II I M M MIIIMI I MM I M I IMIMII II II M 

TGAATTAGAATGTCCAGTTATCGCATTAAGTCAGTTATCTCGTGGTGTTGAACAACGACA 1118 

GGATAAACGTCCGATGATGTCTGATATCCGGGAATCAGGAAGTATCGAGCAGGACGCGGA 1172 

MINIMI II I Ml I INN II Mill II II MM II II II 

AGATAAACGTCCAATGATGAGTGATATTCGTGAATCTGGTTCGATTGAGCAAGATGCCGA 1178 

TATTGTCGCGTTCCTTTATCGTGATGACTACT ATGA 1208 

III II II III I II l M l l 1 1 1 1 1 1 1 1 

TATCGTTGCATTCTTATACCGTGATGATTACTATAACCGTGGCGGCGATGAAGATGATGA 1238 

CAAAGA AACCGA- -GAATAAAA- -ATATTATCGAAATTATTAT 1247 

Ml || IN || MM | || MMIMI II 

CGATGATGGTGGTTTCGAGCCACAAACGAATGATGAAAACGGTGAAATTGAAATTATCAT 12 98 

CGCCAAACAGCGTAACGGCCCGGTAGGAACCGTGTCTCTTGC - GTTCGTAAAAGAATACA 13 06 

N N II MMIMI II IN || HI II I || | Ml INI I 

TGCTAAGCAACGTAACGGTCCAACAGGCACAGT - TAAGTTACATTTTATGAAACAATATA 1357 

ACAAATTCGTCAACCTGGAACGGCGTTTTGATGACGCAGGCGTTCCGCCCGGCGCA 1362 

I IN II II II Mill I I I 

ATAAATT TACCGATATCG - - ATTATGCACATGCAGATATGATG TAA 1401 
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Figure 6C 

SEQ ID NO: 6 DnaC nucleotide B. subtilis 

1 ATGACAGACC TTCTGAATGA CCGGCTTCCT CCGCAAAATA TAGAAGCCGA 

51 ACAAGCCGTG TTAGGCGCTA TTTTTTTACA GCCGTCTGCT TTAACACTGG 

101 CTTCAGAAGT ATTGATTCCA GATGATTTCT ATAGAATGTC CCACCAAAAA 

151 ATCTATAATG CGATGCTGGT GCTCGGTGAC CGAGGTGAAC CGGTTGATCT 

201 GGTGACAGTT ACATCAGAGC TTGCGAACAC AGACCTGCTG GAAGAAGTAG 

251 GCGGTATTTC ATATTTGACA GATATCGCAA ACTCGGTGCC GACAGCGGCT 

3 01 AACATAGAAT ATTACGCGAA AATCGTTGAG GAAAAATCGA TTCTTCGCCG 

351 ATTAATCAGA ACTGCGACAA CGATTGCTCA AGACGGGTAT ACCCGTGAGG 

401 ATGAGGTCGA GGATTTACTC AGTGAAGCGG AAAAAACGAT TATGGAAGTG 

451 GCACAGCGCA AAAACACGAG TGCCTTCCAA AATATTAAGG ACGTCCTTGT 

501 CCAGACCTAT GATAATATCG AACAGCTTTA CAATCGAAAA GGTGATATCA 

551 CGGGAATTCC AACAGGGTTT ACGGAGCTTG ACCGGATGAC TGCGGGTTTC 

601 CAGCGCAACG ACTTGATCAT TGTGGCTGCC CGTCCTTCAG TAGGGAAAAC 

651 AGCCTTTGCC CTGAACATCG CACAAAACGT GGCGACGAAG ACCGATGAGA 

701 GCGTAGCGAT TTTCAGTCTT GAGATGGGTG CCGAGCAGCT CGTTATGCGT 

751 ATGCTCTGTG CCGAGGGAAA TATCAATGCC CAGAATCTCC GTACAGGTAA 

801 CCTGACCGAA GAGGATTGGG GCAAGCTGAC GATGGCAATG GGAAGCCTAT 

851 CGAACAGCGG GATTTACATC GATGATACAC CGGGTATTCG AGTGAGTGAA 

901 ATCCGTGCCA AGTGCCGCCG CTTGAAGCAG GAAAGCGGGC TGGGCATGAT 

951 TTTGATCGAT TACCTGCAAT TGATTCAGGG AAGCGGTCGT TCAAAGGACA 

1001 ACCGTCAGCA GGAAGTATCT GAAATTTCCC GTGAACTGAA GTCGATTGCG 

1051 AGGGAGCTGC AAGTCCCTGT TATCGCGCTT TCTCAGCTTT CCAGGGGTGT 

1101 TGAGCAGCGT CAGGATAAAC GTCCGATGAT GTCTGATATC CGGGAATCAG 

1151 GAAGTATCGA GCAGGACGCG GATATTGTCG CGTTCCTTTA TCGTGATGAC 

1201 TACTATGACA AAGAAACCGA GAATAAAAAT ATTATCGAAA TTATTATCGC 

12 51 CAAACAGCGT AACGGCCCGG TAGGAACCGT GTCTCTTGCG TTCGTAAAAG 

13 01 AATACAACAA ATTCGTCAAC CTGGAACGGC GTTTTGATGA CGCAGGCGTT 
1351 CCGCCCGGCG CA 
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Figure 6D 

SEQ ID NO: 7 DnaC nucleotide S. aureus 

1 ATGGATAGAA TGTATGAGCA AAATCAAATG CCGCATAACA ATGAAGCTGA 

51 ACAGTCTGTC TTAGGTTCAA TTATTATAGA TCCAGAATTG ATTAATACTA 

101 CTCAGGAAGT TTTGCTTCCT GAGTCGTTTT ATAGGGGTGC CCATCAACAT 

151 ATTTTCCGTG CAATGATGCA CTTAAATGAA GATAATAAAG AAATTGATGT 

201 TGTAACATTG ATGGATCAAT TATCGACGGA AGGTACGTTG AATGAAGCGG 

251 GTGGCCCGCA ATATCTTGCA GAGTTATCTA CAAATGTACC AACGACGCGA 

301 AATGTTCAGT ATTATACTGA TATCGTTTCT AAGCATGCAT TAAAACGTAG 

351 ATTGATTCAA ACTGCAGATA GTATTGCCAA TGATGGATAT AATGATGAAC 

401 TTGAACTAGA TGCGATTTTA AGTGATGCAG AACGTCGAAT TTTAGAGCTA 

451 TCATCTTCTC GTGAAAGCGA TGGCTTTAAA GACATTCGAG ACGTCTTAGG 

501 ACAAGTGTAT GAAACAGCTG AAGAGCTTGA TCAAAATAGT GGTCAAACAC 

551 CAGGTATACC TACAGGATAT CGAGATTTAG ACCAAATGAC AGCAGGGTTC 

601 AACCGAAATG ATTTAATTAT CCTTGCAGCG CGTCCATCTG TAGGTAAGAC 

651 TGCGTTCGCA CTTAATATTG CACAAAAAGT TGCAACGCAT GAAGATATGT 

701 ATACAGTTGG TATTTTCTCG CTAGAGATGG GTGCTGATCA GTTAGCCACA 

751 CGTATGATTT GTAGTTCTGG AAATGTTGAC TCAAACCGCT TAAGAACGGG 

8 01 TACTATGACT GAGGAAGATT GGAGTCGTTT TACTATAGCG GTAGGTAAAT 

851 TATCACGTAC GAAGATTTTT ATTGATGATA CACCGGGTAT TCGAATTAAT 

901 GATTTACGTT CTAAATGTCG TCGATTAAAG CAAGAACATG GCTTAGACAT 

951 GATTGTGATT GACTACTTAC AGTTGATTCA AGGTAGTGGT TCACGTGCGT 

1001 CCGATAACAG ACAACAGGAA GTTTCTGAAA TCTCTCGTAC ATTAAAAGCA 

1051 TTAGCCCGTG AATTAGAATG TCCAGTTATC GCATTAAGTC AGTTATCTCG 

1101 TGGTGTTGAA CAACGACAAG ATAAACGTCC AATGATGAGT GATATTCGTG 

1151 AATCTGGTTC GATTGAGCAA GATGCCGATA TCGTTGCATT CTTATACCGT 

12 01 GATGATTACT ATAACCGTGG CGGCGATGAA GATGATGACG ATGATGGTGG 
1251 TTTCGAGCCA CAAACGAATG ATGAAAACGG TGAAATTGAA ATTATCATTG 

13 01 CTAAGCAACG TAACGGTCCA ACAGGCACAG TTAAGTTACA TTTTATGAAA 
13 51 CAATATAATA AATTTACCGA TATCGATTAT GCACATGCAG ATATGATGTA 
1401 A 
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Figure 6E 

Sequence 1 SEQ ID NO: 8 DnaC B . subtilis (490 letters) 

Sequence 2 SEQ ID NO: 9 DnaC S. aureus (503 letters) 

seql 1 MTDLLNDRLPPQNIEAEQAVLGAIFLQPSALTLASEVLIPDDFYRMSHQKIYNAMLVLGD 60 

I : II Nihil - : | : Ihh I :|| |: ||: | : 

seq2 1 MDRMYEQNQMPHNNEAEQSVLGS 1 1 IDPELINTTQEVLLPESFYRGAHQHIFRAMMHLNE 60 

seql 61 RGEPVDLVTVTSELANTDLLEEVGGI S YLTD I ANSVPTAANI EYYAKI VEEKS I LRRL I R 120 

= =1 = 11= = h I I II II :||| |::|| || : := ||||: 

seq2 61 DNKEIDVVTLMDQLSTEGTLNEAGGPQYLAELSTNVPTTRNVQYYTDIVSKHALKRRLIQ 120 

seql 121 TATTIAQDGYTREDEVEDLLSEAEKTIMEVAQRKNTSAFQNIKDVLVQTYDNIEQLYNRK 180 

II =11 III I h = =11 = 11= hi:: = = I = = I = I I E I h hi 

seq2 121 TADSIANDGYNDELELDAILSDAERRILELSSSRESDGFKDIRDVLGQVYETAEELDQNS 180 

seql 181 GDITGIPTGFTELDRMTAGFQRNDLI IVAARPSVGKTAFALNIAQNVATKTD- ESVAIFS 239 

i 1 1 1 r i = :ii:iiiii 1 1 1 1 1 1 = 1 1 r 1 1 f 1 1 1 1 r 1 1 1 1 1 1 mi i = i in 

seq2 181 GQTPGIPTGYRDLDQMTAGFNRNDLIILAARPSVGKTAFALNIAQKVATHEDMYTVGIFS 240 

seql 240 LEMGAEQLVMRMLCAEGNINAQNLRTGNLTEEDWGKLTMAMGSLSNSGIYIDDTPGIRVS 2 99 

Nihil Ihh lh:: I I I I hllll : hhl II : I = I I E I | | M = = 
seq2 241 LEMGADQLATRMI CSSGNVDSNRLRTGTMTEEDWSRFT I AVGKLSRTKI F I DDTPGI R IN 300 

seql 3 00 E I RAKCRRLKQESGLGMI L IDYLQL IQGSG - RS KDNRQQEVSE I SRELKS I ARELQVPVI 358 

-hlllllll || IhllllllllMI h lllllllllll ! I : : i I I I : II 
seq2 3 01 DLRSKCRRLKQEHGLDMIVIDYLQLIQGSGSRASDNRQQEVSEISRTLKALARELECPVI 360 

seql 359 ALSQLSRGVEQRQDKRPMMSDIRESGSIEQDADIVAFLYRDDYYDK 404 

I I I I I I I I I I I I I I I I I I I I I I ' I I I I I I I I I I I I I I I I I I I I- 
seq2 361 ALSQLSRGVEQRQDKRPl^IMSDIRESGSIEQDADIVAFLYRDDYYNRGGDEDDDDDGGFEP 420 

seql 405 ETENKN- 1 1 EI I IAKQRNGPVGTVSLAFVKEYNKFVNLERRFDDAGVPPGA 454 

: I : : I I I I I I I I I I I I I III I h h I I I I : : : I 
seq2 421 QTNDENGEIEI I IAKQRNGPTGTVKLHFMKQYNKFTDIDYAHADM M 466 
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Figure 6F 

SEQ ID NO: 8 DnaC B . subtilis 

1 MTDLLNDRLP PQNIEAEQAV LGAIFLQPSA LTLASEVLIP DDFYRMSHQK 

51 IYNAMLVLGD RGEPVDLVTV TSELANTDLL EEVGGISYLT DIANSVPTAA 

101 NIEYYAKIVE EKSILRRLIR TATTIAQDGY TREDEVEDLL SEAEKTIMEV 

151 AQRKNTSAFQ NIKDVLVQTY DNIEQLYNRK GDITGIPTGF TELDRMTAGF 

201 QRNDLIIVAA RPSVGKTAFA LNIAQNVATK TDESVAIFSL EMGAEQLVMR 

251 MLCAEGNINA QNLRTGNLTE EDWGKLTMAM GSLSNSGIYI DDTPGIRVSE 

301 IRAKCRRLKQ ESGLGMILID YLQLIQGSGR SKDNRQQEVS EISRELKSIA 

351 RELQVPVIAL SQLSRGVEQR QDKRPMMSDI RESGSIEQDA DIVAFLYRDD 

401 YYDKETENKN IIEIIIAKQR NGPVGTVSLA FVKEYNKFVN LERRFDDAGV 

451 PPGA 



SEQ ID NO: 9 DnaC S. aureus 



1 MDRMYEQNQM 

51 I FRAMMHLNE 

101 NVQYYTDIVS 

151 SSSRESDGFK 

201 NRNDLIILAA 

251 RMICSSGNVD 

301 DLRSKCRRLK 

351 LARELECPVI 

401 DDYYNRGGDE 

451 QYNKFTDIDY 



PHNNEAEQSV 
DNKEIDWTL 
KHALKRRLIQ 
DIRDVLGQVY 
RPSVGKTAFA 
SNRLRTGTMT 
QEHGLDMIVI 
ALSQLSRGVE 
DDDDDGGFEP 
AHADMM 



LGSIIIDPEL 
MDQLSTEGTL 
TADS I ANDGY 
ETAEELDQNS 
LNIAQKVATH 
EEDWSRFTIA 
DYLQLIQGSG 
QRQDKRPMMS 
QTNDENGEIE 



INTTQEVLLP 
NEAGGPQYLA 
NDELELDAIL 
GQTPGIPTGY 
EDMYTVGIFS 
VGKLSRTKIF 
SRASDNRQQE 
DIRESGSIEQ 
IIIAKQRNGP 



ESFYRGAHQH 
ELSTNVPTTR 
SDAERRILEL 
RDLDQMTAGF 
LEMGADQLAT 
IDDTPGIRIN 
VSEISRTLKA 
DADIVAFLYR 
TGTVKLHFMK 
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Figure 7A 
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Figure 8A 

GST GST/ORF104 
ACB 0 0.1 0.5 1.0 2.0 ACB 0 0.1 0.5 1.0 2.0 Mr 
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Figure 8B 



GST GST/ORF104 
ACBO 0.1 0.5 1.0 2.0 Mr ACB 0 0.1 0.5 1.0 2.0 
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Figure 8C 



GST GST/ORF104 
Mr ACB 0 0.1 0.5 1.0 2.0 ACB 0 0.1 0.5 1,0 2.0 
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Figure 8D 
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Figure 9 



GST GST/ORF104 C L 

ACB 0 0.1 0.5 1.0 2.0 Mr ACB 0 0.1 0.5 1.0 2.0 2.0 2.0 
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Figure 10 



Lys Extract FP/S Extract 

ACB 0 0.1 0.51.0 2.0 Mr ACB 0 0.1 0.51.0 2.0 
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Figure 12A 



Staphylococcus aureus genomic DNA 



EcoRI 



EcoRI 



PCR with 
dnal-specific 
primers 



BamHI 



DIGESTION WITH 
EcoRI AND BamHI 

BamHI 



SUBCLONING INTO 
PGADT7 VECTOR 




Jerry PELLETIER et al ] 
Compositions and Methods.. 
USSN 09/689,852 



Figure 12B 



PHAGE 77 DNA 



BamHI 



BamHI 



TRP1 



PCR with 
77ORF104-SPECIRC 
PRIMERS 

Pstl 



DIGESTION WITH 
BamHI AND Pstl 

Pstl 



SUBCLONING INTO 
pGBKT7 VECTOR 



GAL4-DNA-BD 
BamHI 

77ORF104 




PGBK77ORF104 

7.5 kb pUC ORI 



2uORI 



KanR 
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Figure 12C 




GAD 
dnal 



NO TRANSCRIPTION 



REPORTER GENES: LacZ, HI S3 AND ADE2 



r77ORFl04 
3NA-BD, 



NO TRANSCRIPTION 



REPORTER GENES: LacZ, HISS AND ADE2 




TRANSCRIPTION 



REPORTER GENES: LacZ, HIS3 AND ADE2 
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Figure 12D 




SD plate without Trp and Leu SD plate without Trp, Leu, His and Ade 

SD: Synthetic medium, Trp: tryptophan, Leu: leucine, His: histidine, Ade: adenine 

1) pGBKT7-53 and pGADT7-T 

2) pGBKT7-53 andpGADdnal 

3) pGBK77ORF104 and pGADT7-T 

4) pGBKT7-LAM and pCLl 

5) pGBK770RF 104 andpGADdnal 

6) pGBK dnal and pGAD77ORF104 
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Figure 12E 

Luminescent Beta-Galactosidase Assay 

12 -r 
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1 2 3 4 5 6 



Sample identification 



Jerry PELLETIER et al 
Compositions and Methods.. 
USSN 09/689,852 



Figure 13 
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FIGURE 14A 
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Figure 14B Chymotrypsin 
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Figure 14C 

Amino acid residues corresponding to interacting partial 
proteolytic fragments . 



Protease 


Proteolytic 


ID of SEQ ID NO:2 fragment 




fragment ID 


interacting with 77ORF104 




(from Fig. 14A, B) 










From amino 


to carboxyl 


Endoproteinase Glu-C 


V24 


117 


313 




V24 


119 


313 










Chymotrypsin 


C38 


12 


313 




C25 


83 


313 


C24 


77 


305 


C23 


77 


304 


C22 


116 


313 


C21 


131 


313 



SEQ ID NO:2 



complete 



1 



313 
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Figure 15 



SEQ ID NO: 16 

>S. aureus Dnal: amino acid 150-313 

AADD I CTAI TNGEQVKGL YL YGP FGTGKS F I LGAI ANQLKS KKVRSTI I YL PE F I RTLKG 
GFKDGSFEKKLHRVREANILMLDDIGAEEVTPWVRDEVIGPLLHYRMVHELPTFFSSNFD 
YSELEHHLAMTRDGEEKTKAARIIERVKSLSTPYFLSGENFRNN 



SEQ ID NO: 17 

>S. aureus dnal: nucleotide 448-942 

gcagcagatgatatttgtacagcaataactaatggggaacaagtgaaaggcctttacctt 
tatggtccatttgggacaggtaaatcttttattctaggtgcaattgcgaatcagctcaaa 
tctaagaaggtacgttcgacaattatttatttaccggaatttattagaacattaaaaggt 
ggctttaaagatggttcttttgaaaagaaattacatcgcgtaagagaagcaaacatttta 
atgcttgatgatattggggctgaagaagtgactccatgggtgagagatgaggtaattgga 
cctttgctacactatcgaatggttcatgaattaccaacattctttagttctaattttgac 
tatagtgaattggaacatcatttagcgatgactcgtgatggtgaagagaagactaaagca 
gcacgtattattgaacgtgtcaaatctttgtcaacaccatactttttatcaggagaaaat 
ttcagaaacaattga 



SEQ ID NO: 18 

>S. aureus Dnal: amino acid 64-313 

YKDQQKHYDGHKFADCPNFVKGHVPELYVDNNRIKIRYLQCPCKIKYDEERFEAELITSH 
HMQRDTLNAKLKD I YMNHRDRLD VAMAADD I CTA I TNGEQVKGL YL YGP FGTGKS F I LGA 
I ANQLKS KKVRSTI I YLPEFIRTLKGGFKDGSFEKKLHRVREANILMLDDIGAEEVTPWV 
RDEVIGPLLHYRMVHELPTFFSSNFDYSELEHHLAMTRDGEEKTKAARIIERVKSLSTPY 
FLSGENFRNN 
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TL minus SD medium THAL minus SD medium 

1. pGADDnal(150-313) and pGBKORF104 

2. pGADDnal(150-313) and pGBKLam 

3. pGADDnal(64-313) and pGBKORF104 

4. pGADDnal(64-313) and pGBKLam 

5. pGADDnal and pGBKORF104 

6. 77pGADORF12 and pGBKORF104 
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Figure 16B 
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